scholarly journals A Tree Based Machine Learning Approach for PTB Diagnostic Dataset

2021 ◽  
Vol 2115 (1) ◽  
pp. 012042
Author(s):  
S Premanand ◽  
Sathiya Narayanan

Abstract The primary objective of this particular paper is to classify the health-related data without feature extraction in Machine Learning, which hinder the performance and reliability. The assumption of our work will be like, can we able to get better result for health-related data with the help of Tree based Machine Learning algorithms without extracting features like in Deep Learning. This study performs better classification with Tree based Machine Learning approach for the health-related medical data. After doing pre-processing, without feature extraction, i.e., from raw data signal with the help of Machine Learning algorithms we are able to get better results. The presented paper which has better result even when compared to some of the advanced Deep Learning architecture models. The results demonstrate that overall classification accuracy of Random Forest, XGBoost, LightGBM and CatBoost, Tree-based Machine Learning algorithms for normal and abnormal condition of the datasets was found to be 97.88%, 98.23%, 98.03% and 95.57% respectively.

Author(s):  
Qianfan Wu ◽  
Adel Boueiz ◽  
Alican Bozkurt ◽  
Arya Masoomi ◽  
Allan Wang ◽  
...  

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.


Author(s):  
Marco A. Alvarez ◽  
SeungJin Lim

Current search engines impose an overhead to motivated students and Internet users who employ the Web as a valuable resource for education. The user, searching for good educational materials for a technical subject, often spends extra time to filter irrelevant pages or ends up with commercial advertisements. It would be ideal if, given a technical subject by user who is educationally motivated, suitable materials with respect to the given subject are automatically identified by an affordable machine processing of the recommendation set returned by a search engine for the subject. In this scenario, the user can save a significant amount of time in filtering out less useful Web pages, and subsequently the user’s learning goal on the subject can be achieved more efficiently without clicking through numerous pages. This type of convenient learning is called One-Stop Learning (OSL). In this paper, the contributions made by Lim and Ko in (Lim and Ko, 2006) for OSL are redefined and modeled using machine learning algorithms. Four selected supervised learning algorithms: Support Vector Machine (SVM), AdaBoost, Naive Bayes and Neural Networks are evaluated using the same data used in (Lim and Ko, 2006). The results presented in this paper are promising, where the highest precision (98.9%) and overall accuracy (96.7%) obtained by using SVM is superior to the results presented by Lim and Ko. Furthermore, the machine learning approach presented here, demonstrates that the small set of features used to represent each Web page yields a good solution for the OSL problem.


2021 ◽  
Vol 2021 ◽  
pp. 1-15
Author(s):  
Absalom E. Ezugwu ◽  
Ibrahim Abaker Targio Hashem ◽  
Olaide N. Oyelade ◽  
Mubarak Almutari ◽  
Mohammed A. Al-Garadi ◽  
...  

The spread of COVID-19 worldwide continues despite multidimensional efforts to curtail its spread and provide treatment. Efforts to contain the COVID-19 pandemic have triggered partial or full lockdowns across the globe. This paper presents a novel framework that intelligently combines machine learning models and the Internet of Things (IoT) technology specifically to combat COVID-19 in smart cities. The purpose of the study is to promote the interoperability of machine learning algorithms with IoT technology by interacting with a population and its environment to curtail the COVID-19 pandemic. Furthermore, the study also investigates and discusses some solution frameworks, which can generate, capture, store, and analyze data using machine learning algorithms. These algorithms can detect, prevent, and trace the spread of COVID-19 and provide a better understanding of the disease in smart cities. Similarly, the study outlined case studies on the application of machine learning to help fight against COVID-19 in hospitals worldwide. The framework proposed in the study is a comprehensive presentation on the major components needed to integrate the machine learning approach with other AI-based solutions. Finally, the machine learning framework presented in this study has the potential to help national healthcare systems in curtailing the COVID-19 pandemic in smart cities. In addition, the proposed framework is poised as a pointer for generating research interests that would yield outcomes capable of been integrated to form an improved framework.


Risks ◽  
2021 ◽  
Vol 9 (3) ◽  
pp. 50
Author(s):  
Apostolos Ampountolas ◽  
Titus Nyarko Nde ◽  
Paresh Date ◽  
Corina Constantinescu

In micro-lending markets, lack of recorded credit history is a significant impediment to assessing individual borrowers’ creditworthiness and therefore deciding fair interest rates. This research compares various machine learning algorithms on real micro-lending data to test their efficacy at classifying borrowers into various credit categories. We demonstrate that off-the-shelf multi-class classifiers such as random forest algorithms can perform this task very well, using readily available data about customers (such as age, occupation, and location). This presents inexpensive and reliable means to micro-lending institutions around the developing world with which to assess creditworthiness in the absence of credit history or central credit databases.


2020 ◽  
Vol 10 (6) ◽  
pp. 6589-6596
Author(s):  
H. Al-Dossari ◽  
F. A. Nughaymish ◽  
Z. Al-Qahtani ◽  
M. Alkahlifah ◽  
A. Alqahtani

Enterprises rely more and more on well-qualified and highly specialized IT professionals. Although the increasing availability of IT jobs is a good indicator for IT graduates, they nonetheless may find themselves confused about the most appropriate career for their future. In this paper, a recommendation system called CareerRec is proposed, which uses machine learning algorithms to help IT graduates select a career path based on their skills. CareerRec was trained and tested using a dataset of 2255 employees in the IT sector in Saudi Arabia. We conducted a performance comparison between five machine learning algorithms to assess their accuracy for predicting the best-suited career path among 3 classes. Our experiments demonstrate that the XGBoost algorithm outperforms other models and gives the highest accuracy (70.47%).


2021 ◽  
Vol 12 (3) ◽  
pp. 1550-1556
Author(s):  
Ravi Kumar Y B Et.al

The current research work encompasses the assessment of similarity based facial features of images with erected method so as to determines the genealogical similarity. It is based on the principle of grouping the closer features, as compared to those which are away from the predefined threshold for a better ascertainment of the extracted features. The system developed is trained using deep learning-oriented architecture incorporating these closer features for a binary classification of the subjects considered into genealogic non-genealogic. The genealogic set of data is further used to calculate the percentage of similarity with erected methods. The present work considered XX datasets from XXXX source for the assessment of facial similarities. The results portrayed an accuracy of 96.3% for genealogic data, the salient among them being those of father-daughter (98.1%), father-son(98.3%), mother-daughter(96.6%), mother-son(96.1%) genealogy in case of the datasets from “kinface W-I”. Extending this work onto “kinface W-II” set of data, the results were promising with father-daughter(98.5%), father-son(96.7%), mother-daughter(93.4%) and mother-son(98.9%) genealogy. Such an approach could be further extended to larger database so as to assess the genealogical similarity with the aid of machine-learning algorithms.


2018 ◽  
Author(s):  
Qianfan Wu ◽  
Adel Boueiz ◽  
Alican Bozkurt ◽  
Arya Masoomi ◽  
Allan Wang ◽  
...  

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.


2020 ◽  
Vol 9 (5) ◽  
pp. 2090-2096
Author(s):  
Hana’ Abd Razak ◽  
M. Ahmed M. Saleh ◽  
Nooritawati Md Tahir

A review on anomalous behavior in crime by other researchers is discussed in this study that focused specifically on the linkage between anomalous behaviors. Next, comprehensive reviews related to gait recognition in utilizing machine learning algorithms for detection and recognition of anomalous behavior is elaborated too. The review begins with the conventional approach of gait recognition that includes feature extraction and classification using PCA, OLS, ANN, and SVM. Further, the review focused on utilization of deep learning namely CNN for anomalous gait behavior detection and transfer learning using pre-trained CNNs such as AlexNet, VGG, and a few more. To the extent of our knowledge, very few studies investigated and explored crime related anomalous behavior based on their gaits, hence this will be the next study that we will explore.


2020 ◽  
Vol 6 (2) ◽  
pp. 107-114
Author(s):  
Tinir Mohamed Sadi ◽  
Raini Hassan

The most common method used by physicians and pulmonologists to evaluate the state of the lung is by listening to the acoustics of the patient's breathing by a stethoscope. Misdiagnosis and eventually, mistreatment are rampant if auscultation is not done properly. There have been efforts to address this problem using a myriad of machine learning algorithms, but little has been done using deep learning. A CNN model with MFCC is expected to mitigate these problems. The problem has been in the paucity of large enough datasets. Results show 0.76 and 0.60 for recall for wheeze and crackle respectively, these number are set to increase with optimization.


Landslides can easily be tragic to human life and property. Increase in the rate of human settlement in the mountains has resulted in safety concerns. Landslides have caused economic loss between 1-2% of the GDP in many developing countries. In this study, we discuss a deep learning approach to detect landslides. Convolutional Neural Networks are used for feature extraction for our proposed model. As there was no source of an exact and precise data set for feature extraction, therefore, a new data set was built for testing the model. We have tested and compared this work with our proposed model and with other machine-learning algorithms such as Logistic Regression, Random Forest, AdaBoost, K-Nearest Neighbors and Support Vector Machine. Our proposed deep learning model produces a classification accuracy of 96.90% outperforming the classical machine-learning algorithms.


Sign in / Sign up

Export Citation Format

Share Document