Big Data Based Diabetes and Heart Disease Prediction System by Employing Supervised Learning Algorithm
Healthcare systems generate bytes and bytes of data and the data growth is exponential. The voluminous data can be analysed effectively, only when the data organization is efficient. Additionally, data retrieval must also be made simpler, such that the healthcare professional can compare and contrast the test sample with the database of health records. This makes it possible to achieve better disease prediction and this work presents a big data based disease prediction system with the help of supervised learning. The proposed approach clusters the related health records, based on every medical attribute followed by which the disease is predicted by SVM classifier. The performance of the proposed disease prediction system is observed to be satisfactory in terms of accuracy, precision, recall, F-measure, while consuming reasonable period of time.