scholarly journals Machine Learning Models for Automatic Labeling: A Systematic Literature Review

Author(s):  
Teodor Fredriksson ◽  
Jan Bosch ◽  
Helena Olsson
2021 ◽  
Vol 21 (2) ◽  
pp. 1-31
Author(s):  
Bjarne Pfitzner ◽  
Nico Steckhan ◽  
Bert Arnrich

Data privacy is a very important issue. Especially in fields like medicine, it is paramount to abide by the existing privacy regulations to preserve patients’ anonymity. However, data is required for research and training machine learning models that could help gain insight into complex correlations or personalised treatments that may otherwise stay undiscovered. Those models generally scale with the amount of data available, but the current situation often prohibits building large databases across sites. So it would be beneficial to be able to combine similar or related data from different sites all over the world while still preserving data privacy. Federated learning has been proposed as a solution for this, because it relies on the sharing of machine learning models, instead of the raw data itself. That means private data never leaves the site or device it was collected on. Federated learning is an emerging research area, and many domains have been identified for the application of those methods. This systematic literature review provides an extensive look at the concept of and research into federated learning and its applicability for confidential healthcare datasets.


2021 ◽  
Author(s):  
Eren Asena ◽  
Henk Cremers

Introduction. Biological psychiatry has yet to find clinically useful biomarkers despite mucheffort. Is this because the field needs better methods and more data, or are current conceptualizations of mental disorders too reductionistic? Although this is an important question, there seems to be no consensus on what it means to be a “reductionist”. Aims. This paper aims to; a) to clarify the views of researchers on different types of reductionism; b) to examine the relationship between these views and the degree to which researchers believe mental disorders can be predicted from biomarkers; c) to compare these predictability estimates with the performance of machine learning models that have used biomarkers to distinguish cases from controls. Methods. We created a survey on reductionism and the predictability of mental disorders from biomarkers, and shared it with researchers in biological psychiatry. Furthermore, a literature review was conducted on the performance of machine learning models in predicting mental disorders from biomarkers. Results. The survey results showed that 9% of the sample were dualists and 57% were explanatory reductionists. There was no relationship between reductionism and perceived predictability. The estimated predictability of 11 mental disorders using currently available methods ranged between 65-80%, which was comparable to the results from the literature review. However, the participants were highly optimistic about the ability of future methods in distinguishing cases from controls. Moreover, although behavioral data were rated as the most effective data type in predicting mental disorders, the participants expected biomarkers to play a significant role in not just predicting, but also defining mental disorders in the future.


2021 ◽  

Background: Nowadays, it can be seen that changes have taken place in the process of diseases and their clinical parameters. Accordingly, in some cases, general medical science and the use of clinical statistics based on the experiences of the physicians are not enough for the provision of sufficient tools for an early and accurate diagnosis. Therefore, medical science increasingly seeks to use unconventional methods and machine learning techniques. The issue of diagnosis in the medical world and the error rate of physicians in this regard are among the main challenges of the condition of patients and diseases. For this reason, in recent years, artificial intelligence tools have been used to help physicians. However, one of the main problems is that the effectiveness of machine learning tools is not studied much. Due to the sensitivity and high prevalence of diseases, especially gastrointestinal cancer, there is a need for a systematic review to identify methods of machine learning and artificial intelligence and compare their impact on the diagnosis of lower gastrointestinal cancers. Objectives: This systematic review aimed to identify the machine learning methods used for the diagnosis of lower gastrointestinal cancers. Moreover, it aimed to classify the presented methods and compare their effectiveness and evaluation indicators. Methods: This systematic review was conducted using six databases. The systematic literature review follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement for systematic reviews. The search strategy consisted of four expressions, namely “machine learning algorithm”, “lower gastrointestinal”, “cancer”, and “diagnosis and screening”, in that order. It should be mentioned that studies based on treatment were excluded from this review. Similarly, studies that presented guidelines, protocols, and instructions were excluded since they only require the focus of clinicians and do not provide progression along an active chain of reasoning. Finally, studies were excluded if they had not undergone a peer-review process. The following aspects were extracted from each article: authors, year, country, machine learning model and algorithm, sample size, the type of data, and the results of the model. The selected studies were classified based on three criteria: 1) machine learning model, 2) cancer type, and 3) effect of machine learning on cancer diagnosis. Results: In total, 44 studies were included in this systematic literature review. The earliest article was published in 2010, and the most recent was from 2019. Among the studies reviewed in this systematic review, one study was performed on the rectum (rectal cancer), one was about the small bowel (small bowel cancer), and 42 studies were on the colon (colon cancer, colorectal cancer, and colonic polyps). In total, 19 out of the 44 (43%) articles from the systematic literature review presented a deep learning model, and 25 (57%) articles used classic machine learning. The models worked mostly on image and all of them were supervised learning models. All studies with deep learning models used Convolutional Neural Network and were published between 2016 and 2019. The studies with classic machine learning models used diverse methods, mostly Support Vector Machine, K-Nearest Neighbors, and Artificial Neural Network. Conclusion: Machine learning methods are suitable tools in the field of cancer diagnosis, especially in cases related to the lower gastrointestinal tract. These methods can not only increase the accuracy of diagnosis and help the doctor to make the right decision, but also help in the early diagnosis of cancer and reduce treatment costs. The methods presented so far have focused more on image data and more than anything else have helped to increase the accuracy of physicians in making the correct diagnosis. Achievement of the right method for early diagnosis requires more accurate data sets and analyses.


2020 ◽  
Vol 2 (1) ◽  
pp. 3-6
Author(s):  
Eric Holloway

Imagination Sampling is the usage of a person as an oracle for generating or improving machine learning models. Previous work demonstrated a general system for using Imagination Sampling for obtaining multibox models. Here, the possibility of importing such models as the starting point for further automatic enhancement is explored.


2021 ◽  
Author(s):  
Norberto Sánchez-Cruz ◽  
Jose L. Medina-Franco

<p>Epigenetic targets are a significant focus for drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents a large amount of structure-activity relationships that has not been exploited thus far for the development of predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. Through a systematic comparison of machine learning models trained on molecular fingerprints of different design, we built predictive models with high accuracy for the epigenetic target profiling of small molecules. The models were thoroughly validated showing mean precisions up to 0.952 for the epigenetic target prediction task. Our results indicate that the herein reported models have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as freely accessible and easy-to-use web application.</p>


2020 ◽  
Author(s):  
Shreya Reddy ◽  
Lisa Ewen ◽  
Pankti Patel ◽  
Prerak Patel ◽  
Ankit Kundal ◽  
...  

<p>As bots become more prevalent and smarter in the modern age of the internet, it becomes ever more important that they be identified and removed. Recent research has dictated that machine learning methods are accurate and the gold standard of bot identification on social media. Unfortunately, machine learning models do not come without their negative aspects such as lengthy training times, difficult feature selection, and overwhelming pre-processing tasks. To overcome these difficulties, we are proposing a blockchain framework for bot identification. At the current time, it is unknown how this method will perform, but it serves to prove the existence of an overwhelming gap of research under this area.<i></i></p>


Sign in / Sign up

Export Citation Format

Share Document