P.1.b.003 Supervised machine learning algorithms predict “correct” classification of retinal ganglia neuron subtypes

We compare two supervised machine learning algorithms—Multinomial Naïve Bayes and Gradient Boosting—to classify social science articles using textual data. The high level of granularity of the classification scheme used and the possibility that multiple categories are assigned to a document make this task challenging. To collect the training data, we query three discipline specific thesauri to retrieve articles corresponding to specialties in the classification. The resulting dataset consists of 113,909 records and covers 245 specialties, aggregated into 31 subdisciplines from three disciplines. Experts were consulted to validate the thesauri-based classification. The resulting multi-label dataset is used to train the machine learning algorithms in different configurations. We deploy a multi-label classifier chaining model, allowing for an arbitrary number of categories to be assigned to each document. The best results are obtained with Gradient Boosting. The approach does not rely on citation data. It can be applied in settings where such information is not available. We conclude that fine-grained text-based classification of social sciences publications at a subdisciplinary level is a hard task, for humans and machines alike. A combination of human expertise and machine learning is suggested as a way forward to improve the classification of social sciences documents.

Download Full-text

Application of supervised machine learning algorithms in the classification of sagittal gait patterns of cerebral palsy children with spastic diplegia

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2019.01.009 ◽

2019 ◽

Vol 106 ◽

pp. 33-39 ◽

Cited By ~ 13

Author(s):

Yanxin Zhang ◽

Ye Ma

Keyword(s):

Machine Learning ◽

Cerebral Palsy ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Spastic Diplegia ◽

Gait Patterns

Download Full-text

Advanced Supervised Machine Learning Algorithms for Efficient Electrofacies Classification of a Carbonate Reservoir in a Giant Southern Iraqi Oil Field

10.4043/30906-ms ◽

2020 ◽

Cited By ~ 1

Author(s):

Watheq J Al-Mudhafar

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Carbonate Reservoir ◽

Oil Field ◽

Machine Learning Algorithms ◽

Supervised Machine Learning

Download Full-text

Application of supervised machine learning algorithms for the classification of regulatory RNA riboswitches

Briefings in Functional Genomics ◽

10.1093/bfgp/elw005 ◽

2016 ◽

pp. elw005 ◽

Cited By ~ 5

Author(s):

Swadha Singh ◽

Raghvendra Singh

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Regulatory Rna

Download Full-text

Classification of Benign and Malignant Breast Cancer using Supervised Machine Learning Algorithms Based on Image and Numeric Datasets

Journal of Physics Conference Series ◽

10.1088/1742-6596/1372/1/012062 ◽

2019 ◽

Vol 1372 ◽

pp. 012062

Author(s):

Ratula Ray ◽

Azian Azamimi Abdullah ◽

Debasish Kumar Mallick ◽

Satya Ranjan Dash

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Malignant Breast

Download Full-text

Binary Classification of Celestial Bodies Using Supervised Machine Learning Algorithms

Algorithms for Intelligent Systems - Proceedings of International Conference on Machine Intelligence and Data Science Applications ◽

10.1007/978-981-33-4087-9_42 ◽

2021 ◽

pp. 495-505

Author(s):

Anwesha Ujjwal Barman ◽

Kritika Shah ◽

Kanchan Lata Kashyap ◽

Avanish Sandilya ◽

Nishq Poorav Desai

Keyword(s):

Machine Learning ◽

Binary Classification ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Celestial Bodies

Download Full-text

Classification of Firewall Logs Using Supervised Machine Learning Algorithms

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i8.301304 ◽

2019 ◽

Vol 7 (8) ◽

pp. 301-304

Author(s):

Hajar Esmaeil As-Suhbani ◽

S.D. Khamitkar

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning

Download Full-text

A Comparison of Supervised Machine Learning Algorithms for Classification of Communications Network Traffic

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-70087-8_47 ◽

2017 ◽

pp. 445-454 ◽

Cited By ~ 10

Author(s):

Pramitha Perera ◽

Yu-Chu Tian ◽

Colin Fidge ◽

Wayne Kelly

Keyword(s):

Machine Learning ◽

Network Traffic ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Communications Network

Download Full-text

Annotation-free Learning of Plankton for Classification and Anomaly Detection

10.1101/856815 ◽

2019 ◽

Author(s):

Vito P. Pastore ◽

Thomas G. Zimmerman ◽

Sujoy Biswas ◽

Simone Bianco

Keyword(s):

Machine Learning ◽

Global Climate ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Marine Phytoplankton ◽

Supervised Machine Learning ◽

Woods Hole Oceanographic Institution ◽

Accurate Detection ◽

Plankton Species

AbstractThe acquisition of increasingly large plankton digital image datasets requires automatic methods of recognition and classification. As data size and collection speed increases, manual annotation and database representation are often bottlenecks for utilization of machine learning algorithms for taxonomic classification of plankton species in field studies. In this paper we present a novel set of algorithms to perform accurate detection and classification of plankton species with minimal supervision. Our algorithms approach the performance of existing supervised machine learning algorithms when tested on a plankton dataset generated from a custom-built lensless digital device. Similar results are obtained on a larger image dataset obtained from the Woods Hole Oceanographic Institution. Our algorithms are designed to provide a new way to monitor the environment with a class of rapid online intelligent detectors.Author SummaryPlankton are at the bottom of the aquatic food chain and marine phytoplankton are estimated to be responsible for over 50% of all global primary production [1] and play a fundamental role in climate regulation. Thus, changes in plankton ecology may have a profound impact on global climate, as well as deep social and economic consequences. It seems therefore paramount to collect and analyze real time plankton data to understand the relationship between the health of plankton and the health of the environment they live in. In this paper, we present a novel set of algorithms to perform accurate detection and classification of plankton species with minimal supervision. The proposed pipeline is designed to provide a new way to monitor the environment with a class of rapid online intelligent detectors.

Download Full-text

Classification of instagram fake users using supervised machine learning algorithms

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v10i3.pp2763-2772 ◽

2020 ◽

Vol 10 (3) ◽

pp. 2763 ◽

Cited By ~ 1

Author(s):

Kristo Radion Purba ◽

David Asirvatham ◽

Raja Kumar Murugesan

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Descriptive Statistics ◽

Supervised Machine Learning ◽

Business Owner ◽

Random Forest Algorithm ◽

Link Availability ◽

Machine Learning Models

On Instagram, the number of followers is a common success indicator. Hence, followers selling services become a huge part of the market. Influencers become bombarded with fake followers and this causes a business owner to pay more than they should for a brand endorsement. Identifying fake followers becomes important to determine the authenticity of an influencer. This research aims to identify fake users' behavior, and proposes supervised machine learning models to classify authentic and fake users. The dataset contains fake users bought from various sources, and authentic users. There are 17 features used, based on these sources: 6 metadata, 3 media info, 2 engagement, 2 media tags, 4 media similarity. Five machine learning algorithms will be tested. Three different approaches of classification are proposed, i.e. classification to 2-classes and 4-classes, and classification with metadata. Random forest algorithm produces the highest accuracy for the 2-classes (authentic, fake) and 4-classes (authentic, active fake user, inactive fake user, spammer) classification, with accuracy up to 91.76%. The result also shows that the five metadata variables, i.e. number of posts, followers, biography length, following, and link availability are the biggest predictors for the users class. Additionally, descriptive statistics results reveal noticeable differences between fake and authentic users.

Download Full-text