scholarly journals Application of Machine Learning for Cytometry Data

2022 ◽  
Vol 12 ◽  
Author(s):  
Zicheng Hu ◽  
Sanchita Bhattacharya ◽  
Atul J. Butte

Modern cytometry technologies present opportunities to profile the immune system at a single-cell resolution with more than 50 protein markers, and have been widely used in both research and clinical settings. The number of publicly available cytometry datasets is growing. However, the analysis of cytometry data remains a bottleneck due to its high dimensionality, large cell numbers, and heterogeneity between datasets. Machine learning techniques are well suited to analyze complex cytometry data and have been used in multiple facets of cytometry data analysis, including dimensionality reduction, cell population identification, and sample classification. Here, we review the existing machine learning applications for analyzing cytometry data and highlight the importance of publicly available cytometry data that enable researchers to develop and validate machine learning methods.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Tomoaki Mameno ◽  
Masahiro Wada ◽  
Kazunori Nozaki ◽  
Toshihito Takahashi ◽  
Yoshitaka Tsujioka ◽  
...  

AbstractThe purpose of this retrospective cohort study was to create a model for predicting the onset of peri-implantitis by using machine learning methods and to clarify interactions between risk indicators. This study evaluated 254 implants, 127 with and 127 without peri-implantitis, from among 1408 implants with at least 4 years in function. Demographic data and parameters known to be risk factors for the development of peri-implantitis were analyzed with three models: logistic regression, support vector machines, and random forests (RF). As the results, RF had the highest performance in predicting the onset of peri-implantitis (AUC: 0.71, accuracy: 0.70, precision: 0.72, recall: 0.66, and f1-score: 0.69). The factor that had the most influence on prediction was implant functional time, followed by oral hygiene. In addition, PCR of more than 50% to 60%, smoking more than 3 cigarettes/day, KMW less than 2 mm, and the presence of less than two occlusal supports tended to be associated with an increased risk of peri-implantitis. Moreover, these risk indicators were not independent and had complex effects on each other. The results of this study suggest that peri-implantitis onset was predicted in 70% of cases, by RF which allows consideration of nonlinear relational data with complex interactions.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 2012 ◽  
Author(s):  
Hashem Koohy

In the era of explosion in biological data, machine learning techniques are becoming more popular in life sciences, including biology and medicine. This research note examines the rise and fall of the most commonly used machine learning techniques in life sciences over the past three decades.


2020 ◽  
Vol 11 (2) ◽  
pp. 71-85
Author(s):  
Nhat-Vinh Lu ◽  
Trong-Nhan Vuong ◽  
Duy-Tai Dinh

Sensory evaluation plays an important role in the food and consumer goods industry. In recent years, the application of machine learning techniques to support food sensory evaluation has become popular. Many different machine learning methods have been applied and produced positive results in this field. In this article, the authors propose a new method to support sensory evaluation on multiple criteria based on the use of a correlation-based feature selection technique, combined with machine learning methods such as linear regression, multilayer perceptron, support vector machine, and random forest. Experimental results are based on considering the correlation between physicochemical components and sensory factors on the Saigon beer dataset.


2017 ◽  
Author(s):  
Ari S. Benjamin ◽  
Hugo L. Fernandes ◽  
Tucker Tomlinson ◽  
Pavan Ramkumar ◽  
Chris VerSteeg ◽  
...  

AbstractNeuroscience has long focused on finding encoding models that effectively ask “what predicts neural spiking?” and generalized linear models (GLMs) are a typical approach. It is often unknown how much of explainable neural activity is captured, or missed, when fitting a GLM. Here we compared the predictive performance of GLMs to three leading machine learning methods: feedforward neural networks, gradient boosted trees (using XGBoost), and stacked ensembles that combine the predictions of several methods. We predicted spike counts in macaque motor (M1) and somatosensory (S1) cortices from standard representations of reaching kinematics, and in rat hippocampal cells from open field location and orientation. In general, the modern methods (particularly XGBoost and the ensemble) produced more accurate spike predictions and were less sensitive to the preprocessing of features. This discrepancy in performance suggests that standard feature sets may often relate to neural activity in a nonlinear manner not captured by GLMs. Encoding models built with machine learning techniques, which can be largely automated, more accurately predict spikes and can offer meaningful benchmarks for simpler models.


Machines ◽  
2018 ◽  
Vol 6 (3) ◽  
pp. 38 ◽  
Author(s):  
Fabrizio Balducci ◽  
Donato Impedovo ◽  
Giuseppe Pirlo

This work aims to show how to manage heterogeneous information and data coming from real datasets that collect physical, biological, and sensory values. As productive companies—public or private, large or small—need increasing profitability with costs reduction, discovering appropriate ways to exploit data that are continuously recorded and made available can be the right choice to achieve these goals. The agricultural field is only apparently refractory to the digital technology and the “smart farm” model is increasingly widespread by exploiting the Internet of Things (IoT) paradigm applied to environmental and historical information through time-series. The focus of this study is the design and deployment of practical tasks, ranging from crop harvest forecasting to missing or wrong sensors data reconstruction, exploiting and comparing various machine learning techniques to suggest toward which direction to employ efforts and investments. The results show how there are ample margins for innovation while supporting requests and needs coming from companies that wish to employ a sustainable and optimized agriculture industrial business, investing not only in technology, but also in the knowledge and in skilled workforce required to take the best out of it.


Quora, an online question-answering platform has a lot of duplicate questions i.e. questions that convey the same meaning. Since it is open to all users, anyone can pose a question any number of times this increases the count of duplicate questions. This paper uses a dataset comprising of question pairs (taken from the Quora website) in different columns with an indication of whether the pair of questions are duplicates or not. Traditional comparison methods like Sequence matcher perform a letter by letter comparison without understanding the contextual information, hence they give lower accuracy. Machine learning methods predict the similarity using features extracted from the context. Both the traditional methods as well as the machine learning methods were compared in this study. The features for the machine learning methods are extracted using the Bag of Words models- Count-Vectorizer and TFIDF-Vectorizer. Among the traditional comparison methods, Sequence matcher gave the highest accuracy of 65.29%. Among the machine learning methods XGBoost gave the highest accuracy, 80.89% when Count-Vectorizer is used and 80.12% when TFIDF-Vectorizer is used.


2020 ◽  
Vol 2 (2) ◽  
Author(s):  
Isonkobong Christopher Udousoro

Due to the complexity of data, interpretation of pattern or extraction of information becomes difficult; therefore application of machine learning is used to teach machines how to handle data more efficiently. With the increase of datasets, various organizations now apply machine learning applications and algorithms. Many industries apply machine learning to extract relevant information for analysis purposes. Many scholars, mathematicians and programmers have carried out research and applied several machine learning approaches in order to find solution to problems. In this paper, we focus on general review of machine learning including various machine learning techniques. These techniques can be applied to different fields like image processing, data mining, predictive analysis and so on. The paper aims at reviewing machine learning techniques and algorithms. The research methodology is based on qualitative analysis where various literatures is being reviewed based  on machine learning.


Healthcare ◽  
2020 ◽  
Vol 8 (2) ◽  
pp. 111 ◽  
Author(s):  
Muhammet Fatih Ak

In the developing world, cancer death is one of the major problems for humankind. Even though there are many ways to prevent it before happening, some cancer types still do not have any treatment. One of the most common cancer types is breast cancer, and early diagnosis is the most important thing in its treatment. Accurate diagnosis is one of the most important processes in breast cancer treatment. In the literature, there are many studies about predicting the type of breast tumors. In this research paper, data about breast cancer tumors from Dr. William H. Walberg of the University of Wisconsin Hospital were used for making predictions on breast tumor types. Data visualization and machine learning techniques including logistic regression, k-nearest neighbors, support vector machine, naïve Bayes, decision tree, random forest, and rotation forest were applied to this dataset. R, Minitab, and Python were chosen to be applied to these machine learning techniques and visualization. The paper aimed to make a comparative analysis using data visualization and machine learning applications for breast cancer detection and diagnosis. Diagnostic performances of applications were comparable for detecting breast cancers. Data visualization and machine learning techniques can provide significant benefits and impact cancer detection in the decision-making process. In this paper, different machine learning and data mining techniques for the detection of breast cancer were proposed. Results obtained with the logistic regression model with all features included showed the highest classification accuracy (98.1%), and the proposed approach revealed the enhancement in accuracy performances. These results indicated the potential to open new opportunities in the detection of breast cancer.


2020 ◽  
Author(s):  
Mark Daly Reed ◽  
Timothy James Le Souef ◽  
Elliot Rampono

BACKGROUND Arthritis is a common condition, which frequently involves the hands. Patients with inflammatory arthritis have been shown to experience significant delays in diagnosis. OBJECTIVE We sought to develop and test a screening tool combining an image of a patient’s hands, a short series of questions, and a single examination technique, to determine the most likely diagnosis in a patient presenting with hand arthritis. Machine learning techniques were used to develop separate algorithms for each component, which were combined to produce a diagnosis. METHODS 280 consecutive new patients presenting to a Rheumatology practice with hand arthritis were enrolled. Each patient completed a 9-part questionnaire, had photographs taken of each hand, and had a single examination result recorded. The Rheumatologist diagnosis was recorded following a 45-minute consultation. The photograph algorithm was developed from a library of 1000 images, and machine learning techniques were applied to the questionnaire results, training several models against the diagnosis from the Rheumatologist. RESULTS The combined algorithms in this study were able to predict inflammatory arthritis with an accuracy, precision, recall and specificity of 96·8%, 97·2%, 98·6% and 90·5% respectively. Similar results were found when inflammatory arthritis was subclassified into rheumatoid arthritis and psoriatic arthritis. The corresponding figures for osteoarthritis were 79·6%, 85·9%, 61·9% and 92·6%. CONCLUSIONS This study demonstrates a novel application of a combined image-processing and a patient questionnaire with applied machine-learning methods, to facilitate the diagnosis of patients presenting with hand arthritis. Preliminary results are encouraging for the application of such techniques in clinical practice. CLINICALTRIAL Not applicable.


Author(s):  
Michael M. Richter

In this article we present relations between complex business processes and machine learning techniques. The processes considered here are mostly related to planning. Planning takes place in preparing many decisions and often it is encountered with a rapidly changing context that constitutes an open world. The underlying structure and preconditions of the processes is quite often not known and hence the processes are regarded as stochastic. One can only observe the processes. Such observations deliver data and these data contain some knowledge about the processes in a hidden form. As a consequence, machine learning methods are involved here. The idea is to give the business persons an overview of quite different machine learning techniques so that they can select suitable ones. We provide a number of examples for business processes that we use for illustrations.


Sign in / Sign up

Export Citation Format

Share Document