scholarly journals Machine Learning for Classification of Postoperative Patient Status Using Standardized Medical Data

Author(s):  
Takanori Yamashita ◽  
Yoshifumi Wakata ◽  
Hideki Nakaguma ◽  
Yasunobu Nohara ◽  
Shinj Hato ◽  
...  

Medical data classification is an important and complex task. Due to the nature of data, the data is in different forms like text, numeric, images and sometimes combination of all. The goal of this paper is to provide a high-level introduction into practical machine learning for purposes of medical data classification. In this paper we use CNN-Auto encoder to extract data from the medical repository and made the classification of heterogeneous medical data. Here Auto encoder uses to get the prime features and CNN is there to extract detailed features. Combination of these two mechanisms are more suitable for medical data classification. Hybrid AE-CNN (auto encoder based Convolutional neural network). Here the performance of proposed mechanism with respect to baseline methods will be assessed. The performance results showed that the proposed mechanism performed well.


2022 ◽  
pp. 41-56
Author(s):  
Jeya Mala Dharmalingam ◽  
Pradeep Reynold A.

As there are several data sets available, this chapter gives insight on which regions of India have been heavily impacted during the first wave of COVID-19 and the classification of patient status using an ML-based data analytics algorithm. The chapter provides a greater insight on the background work and the reports generated based on the analytical results gathered from the data set. In this pandemic situation, such reports will be a great benefit to assess the history of occurrence and the current status of the COVID-19 situation in India.


2021 ◽  
Vol 39 (15_suppl) ◽  
pp. e13583-e13583
Author(s):  
Qingyuan Li ◽  
Ji He

e13583 Background: In the era of data explosion, precision classification of cancer samples based on multi-dimensional medical data provides more insights into disease mechanism and useful hints on clinical treatment associated with tissue of origin, recurrence tendency and prognostic of chemotherapy or immunotherapy. We developed an automatic workflow MLkit to select features from large-scale multi-dimensional medical data and conduct classification through various machine learning techniques. Methods: MLkit is an automatic and one-stop workflow for classification of cancer samples with four modules: preprocessing (missing data remove or imputation and feature standardization), feature selection (unsupervised multi-statistics and supervised multiple machine estimators with recursive feature elimination and cross-validation), modeling (hyper-parameter, performance evaluation and probability calibration) and prediction. Most of current machine learning algorithms were implemented in this workflow, including linear model (logistic regression, ridge regression and stochastic gradient descent), ensemble model (gradient boosting, random forest, xgboost, catboost, lightgmb and stacking), support vector kernel (linear and non-linear), naive Bayes, k-nearest neighbors and multi-layer perceptron neural network. To evaluate the performance of this workflow, we utilized it to fit a model used for prediction of tissue of origin based on 450K DNA methylation data of 2,210 samples from lung, kidney and breast cancer patients collected in TCGA. Results: MLkit performed well in the prediction of tissue of origin for independent validation sets of cancer patients with stable feature selection, automatic hyper-parameters and efficient probability calibration, in which the model achieved AUCs ranged from 0.85 to 0.96. In addition, we also utilized this workflow on extensive real world data and most of results showed superior accuracy and stable performance. Conclusions: MLkit facilitates automated and one-stop classification of cancer samples using machine learning algorithms. It can be operated with simple command line, making it accessible to a broad range of users. The well performance of this workflow based on multi-dimensional medical data can help to improve the discovery of tumor biomarker and optimize clinical follow-up and therapeutic treatment for cancer patients.


Author(s):  
Padmavathi .S ◽  
M. Chidambaram

Text classification has grown into more significant in managing and organizing the text data due to tremendous growth of online information. It does classification of documents in to fixed number of predefined categories. Rule based approach and Machine learning approach are the two ways of text classification. In rule based approach, classification of documents is done based on manually defined rules. In Machine learning based approach, classification rules or classifier are defined automatically using example documents. It has higher recall and quick process. This paper shows an investigation on text classification utilizing different machine learning techniques.


Author(s):  
Hyeuk Kim

Unsupervised learning in machine learning divides data into several groups. The observations in the same group have similar characteristics and the observations in the different groups have the different characteristics. In the paper, we classify data by partitioning around medoids which have some advantages over the k-means clustering. We apply it to baseball players in Korea Baseball League. We also apply the principal component analysis to data and draw the graph using two components for axis. We interpret the meaning of the clustering graphically through the procedure. The combination of the partitioning around medoids and the principal component analysis can be used to any other data and the approach makes us to figure out the characteristics easily.


Sign in / Sign up

Export Citation Format

Share Document