scholarly journals C3IMD : An Efficient Class-Based Clustering Classifier for Im-putation Intelligent Medical Data

2018 ◽  
Vol 7 (2.27) ◽  
pp. 255
Author(s):  
P. Premalatha ◽  
S Subasree ◽  
N K Sakthivel

The fast evolution in medical application yields to abundance of huge amount of data in volume and velocity.  Due to this heterogeneous medical data generation from clinical trials, its typically not free from missing values.  Previously introduced imputation techniques don’t discourse the high spatiality problems and application of distance function that even have curse on high spatiality problem. Thus, there’s a necessity an Efficient and Accurate technique to overcome this problem in Medical Data Analysis. To address the above mentioned issues, this research work proposed an efficient Class-Based Clustering Classifier for Imputation Intelligent Medical Data (C3IMD).  This work was implemented in Bio Weka and studied thoroughly. To improve the classification and prediction accuracy, missing data in Medical Data Sets were filled efficiently with the help of proposed Cluster-Classifier Model. The experiments are repeated with various datasets and results are evaluated and compared with existing classifiers WPT-DELM and SVM-DELM. From the results obtained, it was revealed that the proposed Class-Based Clustering Classifier for Imputation Intelligent Medical Data (C3IMD) is outperforming both the existing models in terms of Classification Accuracy, Sensitivity, Specificity and FScore.  

2018 ◽  
Vol 7 (4.6) ◽  
pp. 138
Author(s):  
Shahebaz Ahmed Khan ◽  
Dr. Seemakurthi ◽  
Dr. Akhil Jabbar

Artificial Neural Networks (ANN) techniques have the important concepts those can be used in the present scenario of the medical world. It has made the medical field to formulate easy steps to detect and predict the diseases like diabetes, thrombocytopenia, heart diseases, brain tumor, cancer etc. The classification methods available in the data mining theories and ANN gradually help to predict the data for the future analysis by building the classification models. In this paper, the results and the research work carried out on diabetic medical data using the artificial neural network algorithms like multilevel perception and its application over such data so as to predict the      diseases are discussed. The rules developed will be helpful to detect the co-disease in the diabetic patients and we have ranked them as per the final classifier for prediction. The proposed classification algorithm has accurately predicted the data with and without feature subset selection.  


The demand for data mining is now unavoidable in the medical industry due to its various applications and uses in predicting the diseases at the early stage. The methods available in the data mining theories are easy to extract the useful patterns and speed to recognize the task based outcomes. In data mining the classification models are really useful in building the classes for the medical data sets for future analysis in an accurate way. Besides these facilities, Association rules in data mining are a promising technique to find hidden patterns in a medical data set and have been successfully applied with market basket data, census data and financial data. Apriori algorithm, is considered to be a classic algorithm, is useful in mining frequent item sets on a database containing a large number of transactions and it also predicts the relevant association rules. Association rules capture the relationship of items that are present in data sets and when the data set contains continuous attributes, the existing algorithms may not work due to this, discretization can be applied to the association rules in order to find the relation between various patterns in data set. In this paper of our research, using Discretized Apriori the research work is done to predict the by-disease in people who are found with diabetic syndrome; also the rules extracted are analyzed. In the discretization step, numerical data is discretized and fed to the Apriori algorithm for better association rules to predict the diseases.


2018 ◽  
Vol 7 (2) ◽  
pp. 7 ◽  
Author(s):  
S Subasree ◽  
N P Gopalan ◽  
N K Sakthivel

Microarray based Cancer Pattern Classification is one of the popular techniques in Bioinformatics Research. This Research Work is noticed that for studying the expression levels through the Gene Expression profiling experiments, thousands of Genes have to be simultaneously studied to understand the patterns of the Gene Expression or Cancer Pattern. This research work proposed an efficient Cancer Pattern Clas-sifier called An Enhanced Multi-Objective Pswarm (EMOPS) and it is studied thoroughly in terms of Memory Utilization, Execution Time (Processing Time), Sensitivity, Specificity, Classification Accuracy and FScore. The results were compared with the recently proposed classifiers namely Hybrid Ant Bee Algorithm (HABA), Kernelized Fuzzy Rough Set Based Semi Supervised Support Vector Machine (KFRS-S3VM) and Multi-objective Particle Swarm Optimization (MPSO). For analyzing the performances of the proposed model, this work considered a few cancer patterns namely Bladder, Breast, Colon, Endometrial, Kidney, Leukemia, Lung, Melanoma, Mom-Hodgkin, Pancreatic, Prostate and Thyroid. From our experimental results, it was noticed that the proposed model outperforms the identified three classifiers in terms of Memory Utilization, Execution Time (Processing Time), Sensitivity, Specificity, Classification Accuracy and FScore. To improve the performance of the system further in term of Processing Time, the proposed model Enhanced Multi-Objective Pswarm (EMOPS) is implemented under Parallel Framework and evaluated. That is the model is tested with Two, Four, Eight and Sixteen Parallel Processors and from the results, it is established that the Processing Time decreases considerably which will improve the performance of the Proposed Model.


2020 ◽  
Vol 39 (3) ◽  
pp. 4227-4243
Author(s):  
Fatma M. Najib ◽  
Rasha M. Ismail ◽  
Nagwa L. Badr ◽  
Tarek F. Gharib

Many recent applications such as sensor networks generate continuous and time varying data streams that are often gathered from multiple data sources with some incompleteness and high dimensionality. Clustering such incomplete high dimensional streaming data faces four constraints which are 1) data incompleteness, 2) high dimensionality of data, 3) data distribution, 4) data streams’ continuous nature. Thus, in this paper, we propose the Subspace clustering for Incomplete High dimensional Data streams (SIHD) framework that overcomes the above clustering issues. The proposed SIHD provides continuous missing values imputation for incomplete streams based on the corresponding nearest-neighbors’ intervals. An adaptive subspace clustering mechanism is proposed to deal with such incomplete high dimensional data streams. Our experimental results using two different data sets prove the efficiency of the proposed SIHD framework in clustering such incomplete high dimensional data streams in terms of accuracy, precision, sensitivity, specificity, and F-score compared to five algorithms GFCM, GBDC-P2P, DS, Ensemble, and DMSC. The proposed SIHD improved: 1) the accuracy on average over the five algorithms in the same mentioned order by 11.3%, 10.8%, 6.5%, 4.1%, and 3.6%, 2) the precision by 15%, 10.6%, 6.4%, 4%, and 3.5%, 3) the sensitivity by 16.6%, 10.6%, 5.8%, 4.2%, and 3.6%, 4) the specificity by 16.8%, 10.9%, 6.5%, 4%, and 3.5%, 5) the F-score by 16.6%, 10.7%, 6.6%, 4.1%, and 3.6%.


The demand for data mining is now unavoidable in the medical industry due to its various applications and uses in predicting the diseases at the early stage. The methods available in the data mining theories are easy to extract the useful patterns and speed to recognize the task based outcomes. In data mining the classification models are really useful in building the classes for the medical data sets for future analysis in an accurate way. Besides these facilities, Association rules in data mining are a promising technique to find hidden patterns in a medical data set and have been successfully applied with market basket data, census data and financial data. Apriori algorithm, is considered to be a classic algorithm, is useful in mining frequent item sets on a database containing a large number of transactions and it also predicts the relevant association rules. Association rules capture the relationship of items that are present in data sets and when the data set contains continuous attributes, the existing algorithms may not work due to this, discretization can be applied to the association rules in order to find the relation between various patterns in data set. In this paper of our research, using Discretized Apriori the research work is done to predict the by-disease in people who are found with diabetic syndrome; also the rules extracted are analyzed. In the discretization step, numerical data is discretized and fed to the Apriori algorithm for better association rules to predict the diseases.


Author(s):  
Khodke harish Eknath ◽  
Yadav S K ◽  
Kyatanavar D N

Information mining frameworks are exhaustively used in coronary affliction for affirmation and figure. As heart condition is that the essential clarification for death for individuals, recognizing confirmation . The work proposed is inductive type and needs deep analysis of the data to ensure the right predictions on the data sets provided. A sample dataset of patients for heart disease will be collected from repository. It involves the steps and procedure. The proposed research work can be carried out step by step to conclude it with the accurate results.


Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1573
Author(s):  
Loris Nanni ◽  
Giovanni Minchio ◽  
Sheryl Brahnam ◽  
Gianluca Maguolo ◽  
Alessandra Lumini

Traditionally, classifiers are trained to predict patterns within a feature space. The image classification system presented here trains classifiers to predict patterns within a vector space by combining the dissimilarity spaces generated by a large set of Siamese Neural Networks (SNNs). A set of centroids from the patterns in the training data sets is calculated with supervised k-means clustering. The centroids are used to generate the dissimilarity space via the Siamese networks. The vector space descriptors are extracted by projecting patterns onto the similarity spaces, and SVMs classify an image by its dissimilarity vector. The versatility of the proposed approach in image classification is demonstrated by evaluating the system on different types of images across two domains: two medical data sets and two animal audio data sets with vocalizations represented as images (spectrograms). Results show that the proposed system’s performance competes competitively against the best-performing methods in the literature, obtaining state-of-the-art performance on one of the medical data sets, and does so without ad-hoc optimization of the clustering methods on the tested data sets.


2020 ◽  
Vol 6 (2) ◽  
pp. 90-97
Author(s):  
Sagir Masanawa ◽  
Hamza Abubakar

In this paper, a hybrid intelligent system that consists of the sparse matrix approach incorporated in neural network learning model as a decision support tool for medical data classification is presented. The main objective of this research is to develop an effective intelligent system that can be used by medical practitioners to accelerate diagnosis and treatment processes. The sparse matrix approach incorporated in neural network learning algorithm for scalability, minimize higher memory storage capacity usage, enhancing implementation time and speed up the analysis of the medical data classification problem. The hybrid intelligent system aims to exploit the advantages of the constituent models and, at the same time, alleviate their limitations. The proposed intelligent classification system maximizes the intelligently classification of medical data and minimizes the number of trends inaccurately identified. To evaluate the effectiveness of the hybrid intelligent system, three benchmark medical data sets, viz., Hepatitis, SPECT Heart and Cleveland Heart from the UCI Repository of Machine Learning, are used for evaluation. A number of useful performance metrics in medical applications which include accuracy, sensitivity, specificity. The results were analyzed and compared with those from other methods published in the literature. The experimental outcomes positively demonstrate that the hybrid intelligent system was effective in undertaking medical data classification tasks.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Gangadhar Ch ◽  
S. Jana ◽  
Sankararao Majji ◽  
Prathyusha Kuncha ◽  
Fantin Irudaya Raj E. ◽  
...  

Purpose For the first time in a decade, a new form of pneumonia virus, coronavirus, COVID-19, appeared in Wuhan, China. To date, it has affected millions of people, killed thousands and resulted in thousands of deaths around the world. To stop the spread of this virus, isolate the infected people. Computed tomography (CT) imaging is very accurate in revealing the details of the lungs and allows oncologists to detect COVID. However, the analysis of CT scans, which can include hundreds of images, may cause delays in hospitals. The use of artificial intelligence (AI) in radiology could help to COVID-19-positive cancer in this manner is the main purpose of the work. Design/methodology/approach CT scans are a medical imaging procedure that gives a three-dimensional (3D) representation of the lungs for clinical purposes. The volumetric 3D data sets can be regarded as axial, coronal and transverse data sets. By using AI, we can diagnose the virus presence. Findings The paper discusses the use of an AI for COVID-19, and CT classification issue and vaccination details of COVID-19 have been detailed in this paper. Originality/value Originality of the work is, all the data can be collected genuinely and did research work doneown methodology.


2021 ◽  
pp. 1-15
Author(s):  
Zhaozhao Xu ◽  
Derong Shen ◽  
Yue Kou ◽  
Tiezheng Nie

Due to high-dimensional feature and strong correlation of features, the classification accuracy of medical data is not as good enough as expected. feature selection is a common algorithm to solve this problem, and selects effective features by reducing the dimensionality of high-dimensional data. However, traditional feature selection algorithms have the blindness of threshold setting and the search algorithms are liable to fall into a local optimal solution. Based on it, this paper proposes a hybrid feature selection algorithm combining ReliefF and Particle swarm optimization. The algorithm is mainly divided into three parts: Firstly, the ReliefF is used to calculate the feature weight, and the features are ranked by the weight. Then ranking feature is grouped according to the density equalization, where the density of features in each group is the same. Finally, the Particle Swarm Optimization algorithm is used to search the ranking feature groups, and the feature selection is performed according to a new fitness function. Experimental results show that the random forest has the highest classification accuracy on the features selected. More importantly, it has the least number of features. In addition, experimental results on 2 medical datasets show that the average accuracy of random forest reaches 90.20%, which proves that the hybrid algorithm has a certain application value.


Sign in / Sign up

Export Citation Format

Share Document