Intelligent Personalized Abnormality Detection for Remote Health Monitoring

In this chapter, an initiative is proposed where natural language processing (NLP) techniques and supervised machine learning algorithms have been combined to detect external plagiarism. The major emphasis is on to construct a framework to detect plagiarism from monolingual texts by implementing n-gram frequency comparison approach. The framework is based on 120 characteristics which have been extracted during pre-processing steps using simple NLP approach. Afterward, filter metrics has been applied to select most relevant features and supervised classification learning algorithm has been used later to classify the documents in four levels of plagiarism. Then, confusion matrix was built to estimate the false positives and false negatives. Finally, the authors have shown C4.5 decision tree-based classifier's suitability on calculating accuracy over naive Bayes. The framework achieved 89% accuracy with low false positive and false negative rate and it shows higher precision and recall value comparing to passage similarities method, sentence similarity method, and search space reduction method.

Download Full-text

Combine Clustering and Machine Learning for Enhancing the Efficiency of Energy Baseline of Chiller System

Energies ◽

10.3390/en13174368 ◽

2020 ◽

Vol 13 (17) ◽

pp. 4368 ◽

Cited By ~ 1

Author(s):

Chun-Wei Chen ◽

Chun-Chang Li ◽

Chen-Yu Lin

Keyword(s):

Machine Learning ◽

Prediction Accuracy ◽

Prediction Models ◽

Machine Learning Algorithms ◽

Learning Models ◽

Important Method ◽

Gap Statistic ◽

Machine Learning Model ◽

Key Variables

Energy baseline is an important method for measuring the energy-saving benefits of chiller system, and the benefits can be calculated by comparing prediction models and actual results. Currently, machine learning is often adopted as a prediction model for energy baselines. Common models include regression, ensemble learning, and deep learning models. In this study, we first reviewed several machine learning algorithms, which were used to establish prediction models. Then, the concept of clustering to preprocess chiller data was adopted. Data mining, K-means clustering, and gap statistic were used to successfully identify the critical variables to cluster chiller modes. Applying these key variables effectively enhanced the quality of the chiller data, and combining the clustering results and the machine learning model effectively improved the prediction accuracy of the model and the reliability of the energy baselines.

Download Full-text

Machine Learning Assisted Cervical Cancer Detection

Frontiers in Public Health ◽

10.3389/fpubh.2021.788376 ◽

2021 ◽

Vol 9 ◽

Author(s):

Mavra Mehmood ◽

Muhammad Rizwan ◽

Michal Gregus ml ◽

Sidra Abbas

Keyword(s):

Machine Learning ◽

Cervical Cancer ◽

Mean Squared Error ◽

Medical Center ◽

Pearson Correlation ◽

False Negative ◽

Hybrid Approach ◽

False Negative Rate ◽

Machine Learning Algorithms ◽

Screening Programs

Cervical malignant growth is the fourth most typical reason for disease demise in women around the globe. Cervical cancer growth is related to human papillomavirus (HPV) contamination. Early screening made cervical cancer a preventable disease that results in minimizing the global burden of cervical cancer. In developing countries, women do not approach sufficient screening programs because of the costly procedures to undergo examination regularly, scarce awareness, and lack of access to the medical center. In this manner, the expectation of the individual patient's risk becomes very high. There are many risk factors relevant to malignant cervical formation. This paper proposes an approach named CervDetect that uses machine learning algorithms to evaluate the risk elements of malignant cervical formation. CervDetect uses Pearson correlation between input variables as well as with the output variable to pre-process the data. CervDetect uses the random forest (RF) feature selection technique to select significant features. Finally, CervDetect uses a hybrid approach by combining RF and shallow neural networks to detect Cervical Cancer. Results show that CervDetect accurately predicts cervical cancer, outperforms the state-of-the-art studies, and achieved an accuracy of 93.6%, mean squared error (MSE) error of 0.07111, false-positive rate (FPR) of 6.4%, and false-negative rate (FNR) of 100%.

Download Full-text

Diagnosis and Classification of the Diabetes Using Machine Learning Algorithms

10.21203/rs.3.rs-514771/v2 ◽

2021 ◽

Author(s):

Prasannavenkatesan Theerthagiri ◽

Usha Ruby A ◽

Vidya J

Keyword(s):

Machine Learning ◽

Multilayer Perceptron ◽

Nearest Neighbor ◽

False Positive Rate ◽

Learning Algorithms ◽

False Negative ◽

False Negative Rate ◽

Disease Diagnosis ◽

Machine Learning Algorithms ◽

K Nearest Neighbor

Abstract Diabetes mellitus is characterized as a chronic disease may cause many complications. The machine learning algorithms are used to diagnosis and predict the diabetes. The learning based algorithms plays a vital role on supporting decision making in disease diagnosis and prediction. In this paper, traditional classification algorithms and neural network based machine learning are investigated for the diabetes dataset. Also, various performance methods with different aspects are evaluated for the K-nearest neighbor, Naive Bayes, extra trees, decision trees, radial basis function, and multilayer perceptron algorithms. It supports the estimation on patients suffering from diabetes in future. The results of this work shows that the multilayer perceptron algorithm gives the highest prediction accuracy with lowest MSE of 0.19. The MLP gives the lowest false positive rate and false negative rate with highest area under curve of 86 %.

Download Full-text

Machine learning improves the prediction of febrile neutropenia in Korean inpatients undergoing chemotherapy for breast cancer

Scientific Reports ◽

10.1038/s41598-020-71927-6 ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 1

Author(s):

Bum-Joo Cho ◽

Kyoung Min Kim ◽

Sanchir-Erdene Bilegsaikhan ◽

Yong Joon Suh

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Risk Factors ◽

Febrile Neutropenia ◽

Prediction Models ◽

Learning Algorithms ◽

Area Under The Curve ◽

Primary Prophylaxis ◽

Machine Learning Algorithms ◽

Significant Difference

Abstract Febrile neutropenia (FN) is one of the most concerning complications of chemotherapy, and its prediction remains difficult. This study aimed to reveal the risk factors for and build the prediction models of FN using machine learning algorithms. Medical records of hospitalized patients who underwent chemotherapy after surgery for breast cancer between May 2002 and September 2018 were selectively reviewed for development of models. Demographic, clinical, pathological, and therapeutic data were analyzed to identify risk factors for FN. Using machine learning algorithms, prediction models were developed and evaluated for performance. Of 933 selected inpatients with a mean age of 51.8 ± 10.7 years, FN developed in 409 (43.8%) patients. There was a significant difference in FN incidence according to age, staging, taxane-based regimen, and blood count 5 days after chemotherapy. The area under the curve (AUC) built based on these findings was 0.870 on the basis of logistic regression. The AUC improved by machine learning was 0.908. Machine learning improves the prediction of FN in patients undergoing chemotherapy for breast cancer compared to the conventional statistical model. In these high-risk patients, primary prophylaxis with granulocyte colony-stimulating factor could be considered.

Download Full-text

Predicting Intraday Prices in the Frontier Stock Market of Romania Using Machine Learning Algorithms

International Journal of Economics and Financial Research ◽

10.32861/ijefr.67.170.179 ◽

2020 ◽

pp. 170-179

Author(s):

Dan Gabriel ANGHEL

Keyword(s):

Machine Learning ◽

Stock Market ◽

Stock Prices ◽

Prediction Accuracy ◽

Prediction Models ◽

State Of The Art ◽

Predictive Ability ◽

Weak Form ◽

Machine Learning Algorithms ◽

Forecasting Models

This paper investigates if forecasting models based on Machine Learning (ML) Algorithms are capable to predict intraday prices in the small, frontier stock market of Romania. The results show that this is indeed the case. Moreover, the prediction accuracy of the various models improves as the forecasting horizon increases. Overall, ML forecasting models are superior to the passive buy and hold strategy, as well as to a naïve strategy that always predicts the last known price action will continue. However, we also show that this superior predictive ability cannot be converted into “abnormal”, economically significant profits after considering transaction costs. This implies that intraday stock prices incorporate information within the accepted bounds of weak-form market efficiency, and cannot be “timed” even by sophisticated investors equipped with state of the art ML prediction models.

Download Full-text

Machine Learning Prediction Models for Mitral Valve Repairability and Mitral Regurgitation Recurrence in Patients Undergoing Surgical Mitral Valve Repair

Bioengineering ◽

10.3390/bioengineering8090117 ◽

2021 ◽

Vol 8 (9) ◽

pp. 117

Author(s):

Marco Penso ◽

Mauro Pepi ◽

Valentina Mantegazza ◽

Claudia Cefalù ◽

Manuela Muratori ◽

...

Keyword(s):

Machine Learning ◽

Mitral Valve ◽

Prediction Models ◽

Area Under The Curve ◽

Mitral Valve Regurgitation ◽

Machine Learning Algorithms ◽

Failure Assessment ◽

Echocardiographic Examination ◽

Repair Procedure ◽

Surgical Failure

Background: Mitral valve regurgitation (MR) is the most common valvular heart disease and current variables associated with MR recurrence are still controversial. We aim to develop a machine learning-based prognostic model to predict causes of mitral valve (MV) repair failure and MR recurrence. Methods: 1000 patients who underwent MV repair at our institution between 2008 and 2018 were enrolled. Patients were followed longitudinally for up to three years. Clinical and echocardiographic data were included in the analysis. Endpoints were MV repair surgical failure with consequent MV replacement or moderate/severe MR (>2+) recurrence at one-month and moderate/severe MR recurrence after three years. Results: 817 patients (DS1) had an echocardiographic examination at one-month while 295 (DS2) also had one at three years. Data were randomly divided into training (DS1: n = 654; DS2: n = 206) and validation (DS1: n = 164; DS2 n = 89) cohorts. For intra-operative or early MV repair failure assessment, the best area under the curve (AUC) was 0.75 and the complexity of mitral valve prolapse was the main predictor. In predicting moderate/severe recurrent MR at three years, the best AUC was 0.92 and residual MR at six months was the most important predictor. Conclusions: Machine learning algorithms may improve prognosis after MV repair procedure, thus improving indications for correct candidate selection for MV surgical repair.

Download Full-text

Reducing U2R and R2L category false negative rates with support vector machines

Serbian Journal of Electrical Engineering ◽

10.2298/sjee131007015m ◽

2014 ◽

Vol 11 (1) ◽

pp. 175-188 ◽

Cited By ~ 1

Author(s):

Nemanja Macek ◽

Milan Milosavljevic

Keyword(s):

Machine Learning ◽

Detection System ◽

False Negative ◽

False Negative Rate ◽

Machine Learning Algorithms ◽

Support Vector ◽

Negative Rate ◽

Machine Learning Model ◽

Vector Machines ◽

Feature Values

The KDD Cup '99 is commonly used dataset for training and testing IDS machine learning algorithms. Some of the major downsides of the dataset are the distribution and the proportions of U2R and R2L instances, which represent the most dangerous attack types, as well as the existence of R2L attack instances identical to normal traffic. This enforces minor category detection complexity and causes problems while building a machine learning model capable of detecting these attacks with sufficiently low false negative rate. This paper presents a new support vector machine based intrusion detection system that classifies unknown data instances according both to the feature values and weight factors that represent importance of features towards the classification. Increased detection rate and significantly decreased false negative rate for U2R and R2L categories, that have a very few instances in the training set, have been empirically proven.

Download Full-text

Diagnosis and Classification of the Diabetes Using Machine Learning Algorithms

10.21203/rs.3.rs-514771/v1 ◽

2021 ◽

Author(s):

Prasannavenkatesan Theerthagiri ◽

Usha Ruby A ◽

Vidya J

Keyword(s):

Machine Learning ◽

Multilayer Perceptron ◽

Nearest Neighbor ◽

False Positive Rate ◽

Learning Algorithms ◽

False Negative ◽

False Negative Rate ◽

Disease Diagnosis ◽

Machine Learning Algorithms ◽

K Nearest Neighbor

Abstract Diabetes mellitus is characterized as a chronic disease may cause many complications. The machine learning algorithms are used to diagnosis and predict the diabetes. The learning based algorithms plays a vital role on supporting decision making in disease diagnosis and prediction. In this paper, traditional classification algorithms and neural network based machine learning are investigated for the diabetes dataset. Also, various performance methods with different aspects are evaluated for the K-nearest neighbor, Naive Bayes, extra trees, decision trees, radial basis function, and multilayer perceptron algorithms. It supports the estimation on patients suffering from diabetes in future. The results of this work shows that the multilayer perceptron algorithm gives the highest prediction accuracy with lowest MSE of 0.19. The MLP gives the lowest false positive rate and false negative rate with highest area under curve of 86 %.

Download Full-text

Diagnosis and Classification of the Diabetes Using Machine Learning Algorithms

10.21203/rs.3.rs-514771/v3 ◽

2021 ◽

Author(s):

Prasannavenkatesan Theerthagiri ◽

Usha Ruby A ◽

Vidya J

Keyword(s):

Machine Learning ◽

Multilayer Perceptron ◽

Nearest Neighbor ◽

False Positive Rate ◽

Learning Algorithms ◽

False Negative ◽

False Negative Rate ◽

Disease Diagnosis ◽

Machine Learning Algorithms ◽

K Nearest Neighbor

Abstract Diabetes mellitus is characterized as a chronic disease may cause many complications. The machine learning algorithms are used to diagnosis and predict the diabetes. The learning based algorithms plays a vital role on supporting decision making in disease diagnosis and prediction. In this paper, traditional classification algorithms and neural network based machine learning are investigated for the diabetes dataset. Also, various performance methods with different aspects are evaluated for the K-nearest neighbor, Naive Bayes, extra trees, decision trees, radial basis function, and multilayer perceptron algorithms. It supports the estimation on patients suffering from diabetes in future. The results of this work shows that the multilayer perceptron algorithm gives the highest prediction accuracy with lowest MSE of 0.19. The MLP gives the lowest false positive rate and false negative rate with highest area under curve of 86 %.

Download Full-text