Constructing a System for Analysis of Machine Learning Techniques for Early Detection of Thyroid

Thyroid is an unending and complex infection caused by unedifying levels of TSH (Thyroid Simulation Hormone) or by thyroid organ problems themselves. Hashimoto's thyroid is the most widely recognized cause of hypothyroidism. The body makes anticorps that pulverize the thyroid organ in an auto-safe condition. It offers machine learning algorithms in the system proposed to predict thyroid disease in disease-intensive societies effectively. This is a serious concern for public health even though it is massively increasing in many countries. This shows that the problem must be predicted as urgently as possible to overcome the shortcomings of previously existing clinical decision-making tools with low precision. This paper examines numerous machine learning strategies for osteoporosis prediction. The paper examines and assesses the use of the strategy of feature selection combined with classification techniques. WEKA's classification techniques are used to measure an osteoporosis data set. The results are calculated by means of various test options, including 10-fold cross-validation, training sets and the percentage divided with and without the selection method. The results are compared with correctly classified instances, runtime, kappa and absolute mean values for experiments with and without feature selection techniques.

Download Full-text

Human Behavior Prediction and Analysis Using Machine Learning-A Review

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i5.1499 ◽

2021 ◽

Vol 12 (5) ◽

pp. 870-876

Author(s):

Monali Gulhane, T.Sajana

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Social Behavior ◽

Human Behavior ◽

Technical Difficulty ◽

Vital Role ◽

Machine Learning Algorithms ◽

Disease Classification ◽

Machine Learning Techniques ◽

Classification Techniques

Nowadays many trends are being in the area of medicine to predict the human behaviour and analysis of patient behaviour is being studied but the technical difficulty of cost efficient method to predict the behaviour of user is overcome in the proposed researched methodology .The mental health of the used can lead to good immunity system to be healthy in this pandemic of COVID-19. Hence After a detailed study on different human health disease classification techniques it is found that machine learning techniques are reliable for the feature extraction and analysis of the different human parameters. CNN is the most optimum choice of classification of diseases. Feature extraction and feature selection is automatically managed by the CNN layers, which reduces the training speed. Techniques like sensor-based feature extraction like EEG, ECG, etc. will be further explored using machine learning algorithms for detection of early detections of diseases from human behavior on different platforms in this research. Social behavior and eating habits play a vital role in disease detection. A system that combines such a wide variety of features with effective classification techniques at each stage is needed. The research in this paper contributes the review of the human behavior analysis through different body parameters, food habits and social media influences with social behavior of the person. The main objective of research is to analysis theses different area parameters to predict the early signs of the diseases.

Download Full-text

Diagnosis of rotating machine unbalance using machine learning algorithms on vibration orbital features

Journal of Vibration and Control ◽

10.1177/1077546320929830 ◽

2020 ◽

pp. 107754632092983

Author(s):

Leonardo S Jablon ◽

Sergio L Avila ◽

Bruno Borba ◽

Gustavo L Mourão ◽

Fabrizio L Freitas ◽

...

Keyword(s):

Machine Learning ◽

Learning Strategies ◽

Machine Learning Algorithms ◽

Vibration Measurement ◽

Machine Learning Techniques ◽

Rolling Element Bearings ◽

Rotating Machines ◽

Signal To Noise ◽

Rotating Machine ◽

Rolling Element

The diagnosis of failures in rotating machines has been subject to studies because of its benefits to maintenance improvement. Condition monitoring reduces maintenance costs, increases reliability and availability, and extends the useful life of critical rotating machinery in industry ambiance. Machine learning techniques have been evolving rapidly, and its applications are bringing better performance to many fields. This study presents a new strategy to improve the diagnosis performance of rotating machines using machine learning strategies on vibration orbital features. The advantage of using orbits in comparison to other vibration measurement systems is the simplicity of the instrumentation involved as well as the information multiplicity contained in the orbit. On the other hand, rolling element bearings are prevalent in industrial machinery. This type of bearing has less orbital oscillation and is noisier than sliding contact bearings. Therefore, it is more difficult to extract useful information. Practical results on an industry motor workbench with rolling element bearings are presented, and the algorithm robustness is evaluated by calculating diagnosis accuracy using inputs with different signal-to-noise ratios. For this kind of noisy scenario where signal analysis is naturally tough, the algorithm classifies approximately 85% of the time correctly. In a completely harsh environment, where the signal-to-noise ratio can be smaller than −25 dB, the accuracy achieved is close to 60%. These statistics show that the strategy proposed can be robust for rotating machine unbalance condition diagnosis even in the worst scenarios, which is required for industrial applications.

Download Full-text

Predictive Analysis of Diabetes Mellitus Using Machine Learning Techniques

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9207 ◽

2020 ◽

Vol 17 (8) ◽

pp. 3449-3452

Author(s):

M. S. Roobini ◽

Y. Sai Satwick ◽

A. Anil Kumar Reddy ◽

M. Lakshmi ◽

D. Deepa ◽

...

Keyword(s):

Diabetes Mellitus ◽

Machine Learning ◽

Early Stage ◽

The Body ◽

Machine Learning Techniques ◽

Support Vector ◽

Pima Indians ◽

Classification Techniques ◽

Learning Techniques ◽

Prediction Of Diabetes

In today’s world diabetes is the major health challenges in India. It is a group of a syndrome that results in too much sugar in the blood. It is a protracted condition that affects the way the body mechanizes the blood sugar. Prevention and prediction of diabetes mellitus is increasingly gaining interest in medical sciences. The aim is how to predict at an early stage of diabetes using different machine learning techniques. In this paper basically, we use well-known classification that are Decision tree, K-Nearest Neighbors, Support Vector Machine, and Random forest. These classification techniques used with Pima Indians diabetes dataset. Therefore, we predict diabetes at different stage and analyze the performance of different classification techniques. We Also proposed a conceptual model for the prediction of diabetes mellitus using different machine learning techniques. In this paper we also compare the accuracy of the different machine learning techniques to finding the diabetes mellitus at early stage.

Download Full-text

Predicting Loan Approval of Bank Direct Marketing Data Using Ensemble Machine Learning Algorithms

International Journal of Circuits, Systems and Signal Processing ◽

10.46300/9106.2020.14.117 ◽

2020 ◽

Vol 14 ◽

Keyword(s):

Machine Learning ◽

Prediction Model ◽

Prediction Models ◽

Machine Learning Algorithms ◽

Decision Makers ◽

Machine Learning Techniques ◽

Data Set ◽

Ensemble Machine Learning ◽

Marketing Data ◽

Loan Approval

The Bank Marketing data set at Kaggle is mostly used in predicting if bank clients will subscribe a long-term deposit. We believe that this data set could provide more useful information such as predicting whether a bank client could be approved for a loan. This is a critical choice that has to be made by decision makers at the bank. Building a prediction model for such high-stakes decision does not only require high model prediction accuracy, but also needs a reasonable prediction interpretation. In this research, different ensemble machine learning techniques have been deployed such as Bagging and Boosting. Our research results showed that the loan approval prediction model has an accuracy of 83.97%, which is approximately 25% better than most state-of-the-art other loan prediction models found in the literature. As well, the model interpretation efforts done in this research was able to explain a few critical cases that the bank decision makers may encounter; therefore, the high accuracy of the designed models was accompanied with a trust in prediction. We believe that the achieved model accuracy accompanied with the provided interpretation information are vitally needed for decision makers to understand how to maintain balance between security and reliability of their financial lending system, while providing fair credit opportunities to their clients.

Download Full-text

Vehicle Price Prediction using SVM Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.g5915.069820 ◽

2020 ◽

Vol 9 (8) ◽

pp. 398-401

Keyword(s):

Machine Learning ◽

Research Area ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector ◽

Data Set ◽

Network Support ◽

Java Application ◽

Learning Techniques ◽

The Individual

The prediction of price for a vehicle has been more popular in research area, and it needs predominant effort and information about the experts of this particular field. The number of different attributes is measured and also it has been considerable to predict the result in more reliable and accurate. To find the price of used vehicles a well defined model has been developed with the help of three machine learning techniques such as Artificial Neural Network, Support Vector Machine and Random Forest. These techniques were used not on the individual items but for the whole group of data items. This data group has been taken from some web portal and that same has been used for the prediction. The data must be collected using web scraper that was written in PHP programming language. Distinct machine learning algorithms of varying performances had been compared to get the best result of the given data set. The final prediction model was integrated into Java application

Download Full-text

Diagnosis of COVID-19 Using CT image Radiomics Features: A Comprehensive Machine Learning Study Involving 26,307 Patients

10.1101/2021.12.07.21267367 ◽

2021 ◽

Author(s):

Isaac Shiri ◽

Yazdan Salimi ◽

Abdollah Saberi ◽

Masoumeh Pakbin ◽

Ghasem Hajianfar ◽

...

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Learning Strategies ◽

Lung Diseases ◽

Pearson Correlation ◽

Characteristic Curve ◽

Univariate Analysis ◽

Machine Learning Algorithms ◽

Recursive Feature Elimination ◽

Classifier Combination

AbstractPurposeTo derive and validate an effective radiomics-based model for differentiation of COVID-19 pneumonia from other lung diseases using a very large cohort of patients.MethodsWe collected 19 private and 5 public datasets, accumulating to 26,307 individual patient images (15,148 COVID-19; 9,657 with other lung diseases e.g. non-COVID-19 pneumonia, lung cancer, pulmonary embolism; 1502 normal cases). Images were automatically segmented using a validated deep learning (DL) model and the results carefully reviewed. Images were first cropped into lung-only region boxes, then resized to 296×216 voxels. Voxel dimensions was resized to 1×1×1mm3 followed by 64-bin discretization. The 108 extracted features included shape, first-order histogram and texture features. Univariate analysis was first performed using simple logistic regression. The thresholds were fixed in the training set and then evaluation performed on the test set. False discovery rate (FDR) correction was applied to the p-values. Z-Score normalization was applied to all features. For multivariate analysis, features with high correlation (R2>0.99) were eliminated first using Pearson correlation. We tested 96 different machine learning strategies through cross-combining 4 feature selectors or 8 dimensionality reduction techniques with 8 classifiers. We trained and evaluated our models using 3 different datasets: 1) the entire dataset (26,307 patients: 15,148 COVID-19; 11,159 non-COVID-19); 2) excluding normal patients in non-COVID-19, and including only RT-PCR positive COVID-19 cases in the COVID-19 class (20,697 patients including 12,419 COVID-19, and 8,278 non-COVID-19)); 3) including only non-COVID-19 pneumonia patients and a random sample of COVID-19 patients (5,582 patients: 3,000 COVID-19, and 2,582 non-COVID-19) to provide balanced classes. Subsequently, each of these 3 datasets were randomly split into 70% and 30% for training and testing, respectively. All various steps, including feature preprocessing, feature selection, and classification, were performed separately in each dataset. Classification algorithms were optimized during training using grid search algorithms. The best models were chosen by a one-standard-deviation rule in 10-fold cross-validation and then were evaluated on the test sets.ResultsIn dataset #1, Relief feature selection and RF classifier combination resulted in the highest performance (Area under the receiver operating characteristic curve (AUC) = 0.99, sensitivity = 0.98, specificity = 0.94, accuracy = 0.96, positive predictive value (PPV) = 0.96, and negative predicted value (NPV) = 0.96). In dataset #2, Recursive Feature Elimination (RFE) feature selection and Random Forest (RF) classifier combination resulted in the highest performance (AUC = 0.99, sensitivity = 0.98, specificity = 0.95, accuracy = 0.97, PPV = 0.96, and NPV = 0.98). In dataset #3, the ANOVA feature selection and RF classifier combination resulted in the highest performance (AUC = 0.98, sensitivity = 0.96, specificity = 0.93, accuracy = 0.94, PPV = 0.93, NPV = 0.96).ConclusionRadiomic features extracted from entire lung combined with machine learning algorithms can enable very effective, routine diagnosis of COVID-19 pneumonia from CT images without the use of any other diagnostic test.

Download Full-text

Future Prediction of Diabetics using XG Booster Classifiers

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c5144.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2128-2132

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

The Body ◽

Machine Learning Algorithms ◽

Support Vector ◽

Common Disease ◽

Data Set ◽

Glucose Content

Diabetes is a most common disease that occurs to most of the humans now a day. The predictions for this disease are proposed through machine learning techniques. Through this method the risk factors of this disease are identified and can be prevented from increasing. Early prediction in such disease can be controlled and save human’s life. For the early predictions of this disease we collect data set having 8 attributes diabetic of 200 patients. The patients’ sugar level in the body is tested by the features of patient’s glucose content in the body and according to the age. The main Machine learning algorithms are Support vector machine (SVM), naive bayes (NB), K nearest neighbor (KNN) and Decision Tree (DT). In the exiting the Naive Bayes the accuracy levels are 66% but in the Decision tree the accuracy levels are 70 to 71%. The accuracy levels of the patients are not proper in range. But in XG boost classifiers even after the Naïve Bayes 74 Percentage and in Decision tree the accuracy levels are 89 to 90%. In the proposed system the accuracy ranges are shown properly and this is only used mostly. A dataset of 729 patients can be stored in Mongo DB and in that 129 patients repots are taken for the prediction purpose and the remaining are used for training. The training datasets are used for the prediction purposes.

Download Full-text

Detection of Cardiac Arrhythmia using Machine Learning Algorithms

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.d4249.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 11704-11707

Keyword(s):

Machine Learning ◽

Cardiac Arrhythmia ◽

Sinus Node ◽

Research Work ◽

Heart Rhythm ◽

The Body ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Medical Attention ◽

Advantages And Disadvantages

Cardiac Arrhythmia is a type of condition a human being suffers from abnormal heart rhythm. This is experienced due to the malfunctioning of electrical impulses that coordinate the heartbeat. When this happens the heartbeats slow/ fast more precisely irregularly. The rhythm of the heart is controlled by a major node called the sinus node which is present at the top of the heart, triggers the electrical pulses which make the heart to beat and pumping of blood to the body. Some of the symptoms of Cardiac Arrhythmia are fainting, unconsciousness, shortness of breath, unexpected functioning of the heart. It leads to death in minutes if medical attention is not provided. To diagnose this doctor, require to study the heart recordings evaluate heartbeats from different parts of the body accurately. It takes a lot of time to evaluate so based on the research work contributed in this field we try to propose a different approach to the same. In this paper, we compare different machine learning techniques and algorithms proposed by different authors and understand the advantages and disadvantages of the system and to bring a new system in place of the existing system where all have used the same ECG recordings from the same database of MIT-BIH. With the initial research work done by us we found out that the use of Phonocardiogram Recordings (PCG) provides more fidelity and accurate compared to ECG recordings. With the initial stage of work, we take the PCG recordings dataset and convert it to a spectrogram image and apply a convolutional neural network to predict the normal or abnormal heartbeat

Download Full-text

Rotor Unbalance Kind and Severity Identification by Current Signature Analysis with Adaptative Update to Multiclass Machine Learning Algorithms

Studies in Engineering and Technology ◽

10.11114/set.v8i1.5213 ◽

2021 ◽

Vol 8 (1) ◽

pp. 28

Author(s):

S. L. Ávila ◽

H. M. Schaberle ◽

S. Youssef ◽

F. S. Pacheco ◽

C. A. Penz

Keyword(s):

Machine Learning ◽

Machine Learning Algorithms ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Signature Analysis ◽

Data Set ◽

Learning Techniques ◽

Environmental Variations ◽

Current Signature

The health of a rotating electric machine can be evaluated by monitoring electrical and mechanical parameters. As more information is available, it easier can become the diagnosis of the machine operational condition. We built a laboratory test bench to study rotor unbalance issues according to ISO standards. Using the electric stator current harmonic analysis, this paper presents a comparison study among Support-Vector Machines, Decision Tree classifies, and One-vs-One strategy to identify rotor unbalance kind and severity problem – a nonlinear multiclass task. Moreover, we propose a methodology to update the classifier for dealing better with changes produced by environmental variations and natural machinery usage. The adaptative update means to update the training data set with an amount of recent data, saving the entire original historical data. It is relevant for engineering maintenance. Our results show that the current signature analysis is appropriate to identify the type and severity of the rotor unbalance problem. Moreover, we show that machine learning techniques can be effective for an industrial application.

Download Full-text

EKMPRFG: Ensemble of KNN, Multilayer Perceptron and Random Forest using Grading for Android Malware Classification

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.e5866.018520 ◽

2020 ◽

Vol 8 (5) ◽

pp. 3353-3360

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Standard Deviation ◽

Principal Component ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Data Sets ◽

Android Malware ◽

Android Malware Detection ◽

Significant Research

Android is the most popular Operating Systems with over 2.5 billion devices across the globe. The popularity of this OS has unfortunately made the devices and the services they enable, vulnerable to numerous security threats. As a result of this, a significant research is being done in the field of Android Malware Detection employing Machine Learning Algorithms. Our current work emphasizes on the possible use of Machine Learning techniques for the detection of malware on such android devices. The proposed EKMPRFG is applied for the classification of Android Malware after a preprocessing phase involving a hybrid Feature Selection model using proposed Standard Deviation of Standard Deviation of Ranks (SDSDR) and several other builtin Feature Selection algorithms such as Correlation based Feature Selection (CFS), Classifier SubsetEval, Consistency SubsetEval, and Filtered SubsetEval followed by Principal Component Analysis(PCA) for dimensionality reduction. The experimental results obtained on two data sets indicate that EKMPRFG outperforms the existing works in terms of Prediction Accuracy and Weighted F- Measure values.

Download Full-text