A Tree Based Machine Learning Approach for PTB Diagnostic Dataset

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.

Download Full-text

A Machine Learning Approach for One-Stop Learning

Data Mining and Knowledge Discovery Technologies ◽

10.4018/978-1-59904-960-1.ch013 ◽

2008 ◽

pp. 333-357 ◽

Cited By ~ 1

Author(s):

Marco A. Alvarez ◽

SeungJin Lim

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Approach ◽

Internet Users ◽

Machine Learning Approach ◽

Supervised Learning Algorithms ◽

One Stop ◽

The Subject

Current search engines impose an overhead to motivated students and Internet users who employ the Web as a valuable resource for education. The user, searching for good educational materials for a technical subject, often spends extra time to filter irrelevant pages or ends up with commercial advertisements. It would be ideal if, given a technical subject by user who is educationally motivated, suitable materials with respect to the given subject are automatically identified by an affordable machine processing of the recommendation set returned by a search engine for the subject. In this scenario, the user can save a significant amount of time in filtering out less useful Web pages, and subsequently the user’s learning goal on the subject can be achieved more efficiently without clicking through numerous pages. This type of convenient learning is called One-Stop Learning (OSL). In this paper, the contributions made by Lim and Ko in (Lim and Ko, 2006) for OSL are redefined and modeled using machine learning algorithms. Four selected supervised learning algorithms: Support Vector Machine (SVM), AdaBoost, Naive Bayes and Neural Networks are evaluated using the same data used in (Lim and Ko, 2006). The results presented in this paper are promising, where the highest precision (98.9%) and overall accuracy (96.7%) obtained by using SVM is superior to the results presented by Lim and Ko. Furthermore, the machine learning approach presented here, demonstrates that the small set of features used to represent each Web page yields a good solution for the OSL problem.

Download Full-text

A Novel Smart City-Based Framework on Perspectives for Application of Machine Learning in Combating COVID-19

BioMed Research International ◽

10.1155/2021/5546790 ◽

2021 ◽

Vol 2021 ◽

pp. 1-15

Author(s):

Absalom E. Ezugwu ◽

Ibrahim Abaker Targio Hashem ◽

Olaide N. Oyelade ◽

Mubarak Almutari ◽

Mohammed A. Al-Garadi ◽

...

Keyword(s):

Machine Learning ◽

Smart Cities ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Learning Approach ◽

Learning Framework ◽

National Healthcare ◽

Machine Learning Approach ◽

The Internet Of Things ◽

Analyze Data

The spread of COVID-19 worldwide continues despite multidimensional efforts to curtail its spread and provide treatment. Efforts to contain the COVID-19 pandemic have triggered partial or full lockdowns across the globe. This paper presents a novel framework that intelligently combines machine learning models and the Internet of Things (IoT) technology specifically to combat COVID-19 in smart cities. The purpose of the study is to promote the interoperability of machine learning algorithms with IoT technology by interacting with a population and its environment to curtail the COVID-19 pandemic. Furthermore, the study also investigates and discusses some solution frameworks, which can generate, capture, store, and analyze data using machine learning algorithms. These algorithms can detect, prevent, and trace the spread of COVID-19 and provide a better understanding of the disease in smart cities. Similarly, the study outlined case studies on the application of machine learning to help fight against COVID-19 in hospitals worldwide. The framework proposed in the study is a comprehensive presentation on the major components needed to integrate the machine learning approach with other AI-based solutions. Finally, the machine learning framework presented in this study has the potential to help national healthcare systems in curtailing the COVID-19 pandemic in smart cities. In addition, the proposed framework is poised as a pointer for generating research interests that would yield outcomes capable of been integrated to form an improved framework.

Download Full-text

A Machine Learning Approach for Micro-Credit Scoring

Risks ◽

10.3390/risks9030050 ◽

2021 ◽

Vol 9 (3) ◽

pp. 50

Author(s):

Apostolos Ampountolas ◽

Titus Nyarko Nde ◽

Paresh Date ◽

Corina Constantinescu

Keyword(s):

Machine Learning ◽

Random Forest ◽

Interest Rates ◽

Credit Scoring ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Learning Approach ◽

Machine Learning Approach ◽

Credit History ◽

Lending Institutions

In micro-lending markets, lack of recorded credit history is a significant impediment to assessing individual borrowers’ creditworthiness and therefore deciding fair interest rates. This research compares various machine learning algorithms on real micro-lending data to test their efficacy at classifying borrowers into various credit categories. We demonstrate that off-the-shelf multi-class classifiers such as random forest algorithms can perform this task very well, using readily available data about customers (such as age, occupation, and location). This presents inexpensive and reliable means to micro-lending institutions around the developing world with which to assess creditworthiness in the absence of credit history or central credit databases.

Download Full-text

A Machine Learning Approach to Career Path Choice for Information Technology Graduates

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.3821 ◽

2020 ◽

Vol 10 (6) ◽

pp. 6589-6596

Author(s):

H. Al-Dossari ◽

F. A. Nughaymish ◽

Z. Al-Qahtani ◽

M. Alkahlifah ◽

A. Alqahtani

Keyword(s):

Machine Learning ◽

Information Technology ◽

Recommendation System ◽

Learning Algorithms ◽

Career Path ◽

Performance Comparison ◽

Machine Learning Algorithms ◽

Learning Approach ◽

It Professionals ◽

Machine Learning Approach

Enterprises rely more and more on well-qualified and highly specialized IT professionals. Although the increasing availability of IT jobs is a good indicator for IT graduates, they nonetheless may find themselves confused about the most appropriate career for their future. In this paper, a recommendation system called CareerRec is proposed, which uses machine learning algorithms to help IT graduates select a career path based on their skills. CareerRec was trained and tested using a dataset of 2255 employees in the IT sector in Saudi Arabia. We conducted a performance comparison between five machine learning algorithms to assess their accuracy for predicting the best-suited career path among 3 classes. Our experiments demonstrate that the XGBoost algorithm outperforms other models and gives the highest accuracy (70.47%).

Download Full-text

Assessment of Facial Homogeneity with Regard to Genealogical Aspects Based on Deep Learning Approach

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i3.962 ◽

2021 ◽

Vol 12 (3) ◽

pp. 1550-1556

Author(s):

Ravi Kumar Y B Et.al

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Binary Classification ◽

Learning Algorithms ◽

Research Work ◽

Machine Learning Algorithms ◽

Facial Features ◽

Learning Approach

The current research work encompasses the assessment of similarity based facial features of images with erected method so as to determines the genealogical similarity. It is based on the principle of grouping the closer features, as compared to those which are away from the predefined threshold for a better ascertainment of the extracted features. The system developed is trained using deep learning-oriented architecture incorporating these closer features for a binary classification of the subjects considered into genealogic non-genealogic. The genealogic set of data is further used to calculate the percentage of similarity with erected methods. The present work considered XX datasets from XXXX source for the assessment of facial similarities. The results portrayed an accuracy of 96.3% for genealogic data, the salient among them being those of father-daughter (98.1%), father-son(98.3%), mother-daughter(96.6%), mother-son(96.1%) genealogy in case of the datasets from “kinface W-I”. Extending this work onto “kinface W-II” set of data, the results were promising with father-daughter(98.5%), father-son(96.7%), mother-daughter(93.4%) and mother-son(98.9%) genealogy. Such an approach could be further extended to larger database so as to assess the genealogical similarity with the aid of machine-learning algorithms.

Download Full-text

Deep learning for predicting disease status using genomic data

10.7287/peerj.preprints.27123v1 ◽

2018 ◽

Author(s):

Qianfan Wu ◽

Adel Boueiz ◽

Alican Bozkurt ◽

Arya Masoomi ◽

Allan Wang ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Rapid Development ◽

Learning Algorithms ◽

Genomic Data ◽

Disease Status ◽

Machine Learning Algorithms ◽

High Dimensional ◽

Learning Approach ◽

Low Dimensional

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.

Download Full-text

Review on anomalous gait behavior detection using machine learning algorithms

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i5.2255 ◽

2020 ◽

Vol 9 (5) ◽

pp. 2090-2096

Author(s):

Hana’ Abd Razak ◽

M. Ahmed M. Saleh ◽

Nooritawati Md Tahir

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Deep Learning ◽

Transfer Learning ◽

Gait Recognition ◽

Learning Algorithms ◽

Anomalous Behavior ◽

Machine Learning Algorithms ◽

Behavior Detection ◽

Detection And Recognition

A review on anomalous behavior in crime by other researchers is discussed in this study that focused specifically on the linkage between anomalous behaviors. Next, comprehensive reviews related to gait recognition in utilizing machine learning algorithms for detection and recognition of anomalous behavior is elaborated too. The review begins with the conventional approach of gait recognition that includes feature extraction and classification using PCA, OLS, ANN, and SVM. Further, the review focused on utilization of deep learning namely CNN for anomalous gait behavior detection and transfer learning using pre-trained CNNs such as AlexNet, VGG, and a few more. To the extent of our knowledge, very few studies investigated and explored crime related anomalous behavior based on their gaits, hence this will be the next study that we will explore.

Download Full-text

Development of Classification Methods for Wheeze and Crackle Using Mel Frequency Cepstral Coefficient (MFCC): A Deep Learning Approach

International Journal on Perceptive and Cognitive Computing ◽

10.31436/ijpcc.v6i2.166 ◽

2020 ◽

Vol 6 (2) ◽

pp. 107-114

Author(s):

Tinir Mohamed Sadi ◽

Raini Hassan

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithms ◽

The State ◽

Machine Learning Algorithms ◽

Common Method ◽

Learning Approach ◽

Classification Methods ◽

Mel Frequency Cepstral Coefficient

The most common method used by physicians and pulmonologists to evaluate the state of the lung is by listening to the acoustics of the patient's breathing by a stethoscope. Misdiagnosis and eventually, mistreatment are rampant if auscultation is not done properly. There have been efforts to address this problem using a myriad of machine learning algorithms, but little has been done using deep learning. A CNN model with MFCC is expected to mitigate these problems. The problem has been in the paucity of large enough datasets. Results show 0.76 and 0.60 for recall for wheeze and crackle respectively, these number are set to increase with optimization.

Download Full-text

A Research on Deep Learning Advance for Landslide Classification using Convolutional Neural Networks

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f1184.0486s419 ◽

2019 ◽

Vol 8 (6S4) ◽

pp. 903-906

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Feature Extraction ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Data Set ◽

Proposed Model

Landslides can easily be tragic to human life and property. Increase in the rate of human settlement in the mountains has resulted in safety concerns. Landslides have caused economic loss between 1-2% of the GDP in many developing countries. In this study, we discuss a deep learning approach to detect landslides. Convolutional Neural Networks are used for feature extraction for our proposed model. As there was no source of an exact and precise data set for feature extraction, therefore, a new data set was built for testing the model. We have tested and compared this work with our proposed model and with other machine-learning algorithms such as Logistic Regression, Random Forest, AdaBoost, K-Nearest Neighbors and Support Vector Machine. Our proposed deep learning model produces a classification accuracy of 96.90% outperforming the classical machine-learning algorithms.

Download Full-text