Application of Machine Learning for Cytometry Data

Frontiers in Immunology ◽

10.3389/fimmu.2021.787574 ◽

2022 ◽

Vol 12 ◽

Author(s):

Zicheng Hu ◽

Sanchita Bhattacharya ◽

Atul J. Butte

Keyword(s):

Machine Learning ◽

Large Cell ◽

Machine Learning Techniques ◽

Clinical Settings ◽

Protein Markers ◽

Reduction Cell ◽

Machine Learning Methods ◽

Sample Classification ◽

Learning Techniques ◽

Machine Learning Applications

Modern cytometry technologies present opportunities to profile the immune system at a single-cell resolution with more than 50 protein markers, and have been widely used in both research and clinical settings. The number of publicly available cytometry datasets is growing. However, the analysis of cytometry data remains a bottleneck due to its high dimensionality, large cell numbers, and heterogeneity between datasets. Machine learning techniques are well suited to analyze complex cytometry data and have been used in multiple facets of cytometry data analysis, including dimensionality reduction, cell population identification, and sample classification. Here, we review the existing machine learning applications for analyzing cytometry data and highlight the importance of publicly available cytometry data that enable researchers to develop and validate machine learning methods.

Download Full-text

Predictive modeling for peri-implantitis by using machine learning techniques

Scientific Reports ◽

10.1038/s41598-021-90642-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Tomoaki Mameno ◽

Masahiro Wada ◽

Kazunori Nozaki ◽

Toshihito Takahashi ◽

Yoshitaka Tsujioka ◽

...

Keyword(s):

Machine Learning ◽

Demographic Data ◽

Risk Indicators ◽

Machine Learning Techniques ◽

Support Vector ◽

Machine Learning Methods ◽

Complex Interactions ◽

Learning Techniques ◽

Increased Risk ◽

Vector Machines

AbstractThe purpose of this retrospective cohort study was to create a model for predicting the onset of peri-implantitis by using machine learning methods and to clarify interactions between risk indicators. This study evaluated 254 implants, 127 with and 127 without peri-implantitis, from among 1408 implants with at least 4 years in function. Demographic data and parameters known to be risk factors for the development of peri-implantitis were analyzed with three models: logistic regression, support vector machines, and random forests (RF). As the results, RF had the highest performance in predicting the onset of peri-implantitis (AUC: 0.71, accuracy: 0.70, precision: 0.72, recall: 0.66, and f1-score: 0.69). The factor that had the most influence on prediction was implant functional time, followed by oral hygiene. In addition, PCR of more than 50% to 60%, smoking more than 3 cigarettes/day, KMW less than 2 mm, and the presence of less than two occlusal supports tended to be associated with an increased risk of peri-implantitis. Moreover, these risk indicators were not independent and had complex effects on each other. The results of this study suggest that peri-implantitis onset was predicted in 70% of cases, by RF which allows consideration of nonlinear relational data with complex interactions.

Download Full-text

The rise and fall of machine learning methods in biomedical research

F1000Research ◽

10.12688/f1000research.13016.1 ◽

2017 ◽

Vol 6 ◽

pp. 2012 ◽

Cited By ~ 6

Author(s):

Hashem Koohy

Keyword(s):

Machine Learning ◽

Biomedical Research ◽

Life Sciences ◽

Biological Data ◽

Research Note ◽

Machine Learning Techniques ◽

Learning Methods ◽

The Past ◽

Machine Learning Methods ◽

Learning Techniques

In the era of explosion in biological data, machine learning techniques are becoming more popular in life sciences, including biology and medicine. This research note examines the rise and fall of the most commonly used machine learning techniques in life sciences over the past three decades.

Download Full-text

Combining Correlation-Based Feature and Machine Learning for Sensory Evaluation of Saigon Beer

International Journal of Knowledge and Systems Science ◽

10.4018/ijkss.2020040104 ◽

2020 ◽

Vol 11 (2) ◽

pp. 71-85

Author(s):

Nhat-Vinh Lu ◽

Trong-Nhan Vuong ◽

Duy-Tai Dinh

Keyword(s):

Machine Learning ◽

Sensory Evaluation ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Methods ◽

Feature Selection Technique ◽

Machine Learning Methods ◽

Learning Techniques ◽

Correlation Based Feature Selection ◽

Positive Results

Sensory evaluation plays an important role in the food and consumer goods industry. In recent years, the application of machine learning techniques to support food sensory evaluation has become popular. Many different machine learning methods have been applied and produced positive results in this field. In this article, the authors propose a new method to support sensory evaluation on multiple criteria based on the use of a correlation-based feature selection technique, combined with machine learning methods such as linear regression, multilayer perceptron, support vector machine, and random forest. Experimental results are based on considering the correlation between physicochemical components and sensory factors on the Saigon beer dataset.

Download Full-text

Modern machine learning outperforms GLMs at predicting spikes

10.1101/111450 ◽

2017 ◽

Cited By ~ 4

Author(s):

Ari S. Benjamin ◽

Hugo L. Fernandes ◽

Tucker Tomlinson ◽

Pavan Ramkumar ◽

Chris VerSteeg ◽

...

Keyword(s):

Machine Learning ◽

Neural Activity ◽

Linear Models ◽

Feedforward Neural Networks ◽

Predictive Performance ◽

Machine Learning Techniques ◽

Machine Learning Methods ◽

Learning Techniques ◽

Neural Spiking ◽

Modern Machine

AbstractNeuroscience has long focused on finding encoding models that effectively ask “what predicts neural spiking?” and generalized linear models (GLMs) are a typical approach. It is often unknown how much of explainable neural activity is captured, or missed, when fitting a GLM. Here we compared the predictive performance of GLMs to three leading machine learning methods: feedforward neural networks, gradient boosted trees (using XGBoost), and stacked ensembles that combine the predictions of several methods. We predicted spike counts in macaque motor (M1) and somatosensory (S1) cortices from standard representations of reaching kinematics, and in rat hippocampal cells from open field location and orientation. In general, the modern methods (particularly XGBoost and the ensemble) produced more accurate spike predictions and were less sensitive to the preprocessing of features. This discrepancy in performance suggests that standard feature sets may often relate to neural activity in a nonlinear manner not captured by GLMs. Encoding models built with machine learning techniques, which can be largely automated, more accurately predict spikes and can offer meaningful benchmarks for simpler models.

Download Full-text

Machine Learning Applications on Agricultural Datasets for Smart Farm Enhancement

Machines ◽

10.3390/machines6030038 ◽

2018 ◽

Vol 6 (3) ◽

pp. 38 ◽

Cited By ~ 22

Author(s):

Fabrizio Balducci ◽

Donato Impedovo ◽

Giuseppe Pirlo

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Agricultural Field ◽

Heterogeneous Information ◽

Learning Techniques ◽

Farm Model ◽

Machine Learning Applications ◽

Crop Harvest ◽

The Right ◽

The Internet Of Things

This work aims to show how to manage heterogeneous information and data coming from real datasets that collect physical, biological, and sensory values. As productive companies—public or private, large or small—need increasing profitability with costs reduction, discovering appropriate ways to exploit data that are continuously recorded and made available can be the right choice to achieve these goals. The agricultural field is only apparently refractory to the digital technology and the “smart farm” model is increasingly widespread by exploiting the Internet of Things (IoT) paradigm applied to environmental and historical information through time-series. The focus of this study is the design and deployment of practical tasks, ranging from crop harvest forecasting to missing or wrong sensors data reconstruction, exploiting and comparing various machine learning techniques to suggest toward which direction to employ efforts and investments. The results show how there are ample margins for innovation while supporting requests and needs coming from companies that wish to employ a sustainable and optimized agriculture industrial business, investing not only in technology, but also in the knowledge and in skilled workforce required to take the best out of it.

Download Full-text

Identification of Duplication in Questions Posed on Knowledge Sharing Platform Quora using Machine Learning Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3017.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 2444-2451

Keyword(s):

Machine Learning ◽

Question Answering ◽

Contextual Information ◽

Machine Learning Techniques ◽

Learning Methods ◽

Machine Learning Methods ◽

Comparison Methods ◽

Learning Techniques ◽

Letter Comparison ◽

Lower Accuracy

Quora, an online question-answering platform has a lot of duplicate questions i.e. questions that convey the same meaning. Since it is open to all users, anyone can pose a question any number of times this increases the count of duplicate questions. This paper uses a dataset comprising of question pairs (taken from the Quora website) in different columns with an indication of whether the pair of questions are duplicates or not. Traditional comparison methods like Sequence matcher perform a letter by letter comparison without understanding the contextual information, hence they give lower accuracy. Machine learning methods predict the similarity using features extracted from the context. Both the traditional methods as well as the machine learning methods were compared in this study. The features for the machine learning methods are extracted using the Bag of Words models- Count-Vectorizer and TFIDF-Vectorizer. Among the traditional comparison methods, Sequence matcher gave the highest accuracy of 65.29%. Among the machine learning methods XGBoost gave the highest accuracy, 80.89% when Count-Vectorizer is used and 80.12% when TFIDF-Vectorizer is used.

Download Full-text

Machine Learning:A Review

Semiconductor Science and Information Devices ◽

10.30564/ssid.v2i2.1931 ◽

2020 ◽

Vol 2 (2) ◽

Author(s):

Isonkobong Christopher Udousoro

Keyword(s):

Machine Learning ◽

Image Processing ◽

Data Interpretation ◽

Relevant Information ◽

Machine Learning Techniques ◽

Predictive Analysis ◽

Learning Approaches ◽

Processing Data ◽

Learning Techniques ◽

Machine Learning Applications

Due to the complexity of data, interpretation of pattern or extraction of information becomes difficult; therefore application of machine learning is used to teach machines how to handle data more efficiently. With the increase of datasets, various organizations now apply machine learning applications and algorithms. Many industries apply machine learning to extract relevant information for analysis purposes. Many scholars, mathematicians and programmers have carried out research and applied several machine learning approaches in order to find solution to problems. In this paper, we focus on general review of machine learning including various machine learning techniques. These techniques can be applied to different fields like image processing, data mining, predictive analysis and so on. The paper aims at reviewing machine learning techniques and algorithms. The research methodology is based on qualitative analysis where various literatures is being reviewed based on machine learning.

Download Full-text

A Comparative Analysis of Breast Cancer Detection and Diagnosis Using Data Visualization and Machine Learning Applications

Healthcare ◽

10.3390/healthcare8020111 ◽

2020 ◽

Vol 8 (2) ◽

pp. 111 ◽

Cited By ~ 1

Author(s):

Muhammet Fatih Ak

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Data Visualization ◽

Cancer Detection ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Machine Learning Applications ◽

Cancer Types ◽

Detection And Diagnosis ◽

Using Data

In the developing world, cancer death is one of the major problems for humankind. Even though there are many ways to prevent it before happening, some cancer types still do not have any treatment. One of the most common cancer types is breast cancer, and early diagnosis is the most important thing in its treatment. Accurate diagnosis is one of the most important processes in breast cancer treatment. In the literature, there are many studies about predicting the type of breast tumors. In this research paper, data about breast cancer tumors from Dr. William H. Walberg of the University of Wisconsin Hospital were used for making predictions on breast tumor types. Data visualization and machine learning techniques including logistic regression, k-nearest neighbors, support vector machine, naïve Bayes, decision tree, random forest, and rotation forest were applied to this dataset. R, Minitab, and Python were chosen to be applied to these machine learning techniques and visualization. The paper aimed to make a comparative analysis using data visualization and machine learning applications for breast cancer detection and diagnosis. Diagnostic performances of applications were comparable for detecting breast cancers. Data visualization and machine learning techniques can provide significant benefits and impact cancer detection in the decision-making process. In this paper, different machine learning and data mining techniques for the detection of breast cancer were proposed. Results obtained with the logistic regression model with all features included showed the highest classification accuracy (98.1%), and the proposed approach revealed the enhancement in accuracy performances. These results indicated the potential to open new opportunities in the detection of breast cancer.

Download Full-text

A pilot study of a machine-learning tool to assist in the diagnosis of hand arthritis (Preprint)

10.2196/preprints.20354 ◽

2020 ◽

Author(s):

Mark Daly Reed ◽

Timothy James Le Souef ◽

Elliot Rampono

Keyword(s):

Machine Learning ◽

Inflammatory Arthritis ◽

Machine Learning Techniques ◽

Patient Questionnaire ◽

Machine Learning Methods ◽

Applied Machine Learning ◽

Learning Techniques ◽

Examination Result ◽

Delays In Diagnosis ◽

Rheumatology Practice

BACKGROUND Arthritis is a common condition, which frequently involves the hands. Patients with inflammatory arthritis have been shown to experience significant delays in diagnosis. OBJECTIVE We sought to develop and test a screening tool combining an image of a patient’s hands, a short series of questions, and a single examination technique, to determine the most likely diagnosis in a patient presenting with hand arthritis. Machine learning techniques were used to develop separate algorithms for each component, which were combined to produce a diagnosis. METHODS 280 consecutive new patients presenting to a Rheumatology practice with hand arthritis were enrolled. Each patient completed a 9-part questionnaire, had photographs taken of each hand, and had a single examination result recorded. The Rheumatologist diagnosis was recorded following a 45-minute consultation. The photograph algorithm was developed from a library of 1000 images, and machine learning techniques were applied to the questionnaire results, training several models against the diagnosis from the Rheumatologist. RESULTS The combined algorithms in this study were able to predict inﬂammatory arthritis with an accuracy, precision, recall and specificity of 96·8%, 97·2%, 98·6% and 90·5% respectively. Similar results were found when inflammatory arthritis was subclassified into rheumatoid arthritis and psoriatic arthritis. The corresponding figures for osteoarthritis were 79·6%, 85·9%, 61·9% and 92·6%. CONCLUSIONS This study demonstrates a novel application of a combined image-processing and a patient questionnaire with applied machine-learning methods, to facilitate the diagnosis of patients presenting with hand arthritis. Preliminary results are encouraging for the application of such techniques in clinical practice. CLINICALTRIAL Not applicable.

Download Full-text

Business Processes, Dynamic Contexts, Learning

Encyclopedia of Business Analytics and Optimization ◽

10.4018/978-1-4666-5202-6.ch037 ◽

2014 ◽

pp. 407-417

Author(s):

Michael M. Richter

Keyword(s):

Machine Learning ◽

Business Processes ◽

Machine Learning Techniques ◽

Underlying Structure ◽

Learning Methods ◽

Machine Learning Methods ◽

Open World ◽

Learning Techniques

In this article we present relations between complex business processes and machine learning techniques. The processes considered here are mostly related to planning. Planning takes place in preparing many decisions and often it is encountered with a rapidly changing context that constitutes an open world. The underlying structure and preconditions of the processes is quite often not known and hence the processes are regarded as stochastic. One can only observe the processes. Such observations deliver data and these data contain some knowledge about the processes in a hidden form. As a consequence, machine learning methods are involved here. The idea is to give the business persons an overview of quite different machine learning techniques so that they can select suitable ones. We provide a number of examples for business processes that we use for illustrations.

Download Full-text