scholarly journals CLASSIFICATION MODEL FOR BREAST CANCER MAMMOGRAMS

2022 ◽  
Vol 23 (1) ◽  
pp. 187-199
Author(s):  
Suzani Mohamad Samuri ◽  
Try Viananda Nova ◽  
Bahbibi Rahmatullah ◽  
Shir Li Wang ◽  
Z.T Al-Qaysi

Machine learning has been the topic of interest in research related to early detection of breast cancer based on mammogram images. In this study, we compare the performance results from three (3) types of machine learning techniques: 1) Naïve Bayes (NB), 2) Neural Network (NN) and 3) Support Vector Machine (SVM) with 2000 digital mammogram images to choose the best technique that could model the relationship between the features extracted and the state of the breast (‘Normal’ or ‘Cancer’). Grey Level Co-occurrence Matrix (GLCM) which represents the two dimensions of the level variation gray in the image is used in the feature extraction process. Six (6) attributes consist of contrast, variance, standard deviation, kurtosis, mean and smoothness were computed as feature extracted and used as the inputs for the classification process. The data has been randomized and the experiment has been repeated for ten (10) times to check for the consistencies of the performance of all techniques. 70% of the data were used as the training data and another 30% used as testing data. The result after ten (10) experiments show that, Support Vector Machine (SVM) gives the most consistent results in correctly classifying the state of the breast as ‘Normal’ or ‘Cancer’, with the accuracy of 99.4%, in training and 98.76% in testing. The SVM classification model has outperformed NN and NB model in the study, and it shows that SVM is a good choice for determining the state of the breast at the early stage. ABSTRAK: Pembelajaran mesin telah menjadi topik yang diminati dalam penyelidikan yang berkaitan dengan pengesanan awal kanser payudara berdasarkan imej mamogram. Dalam kajian ini, kami membandingkan hasil prestasi dari tiga (3) jenis teknik pembelajaran mesin: 1) Naïve Bayes (NB), 2) Neural Network (NN) dan 3) Support Vector Machine (SVM) dengan 2000 imej digital mammogram hingga teknik terbaik yang dapat memodelkan hubungan antara ciri yang diekstraksi dan keadaan payudara ('Normal' atau 'Cancer') dapat diperoleh. Grey Level Co-occurrence Matrix (GLCM) yang mewakili dua dimensi variasi tahap kelabu pada gambar digunakan dalam proses pengekstrakan ciri. Enam (6) atribut terdiri dari kontras, varians, sisihan piawai, kurtosis, min dan kehalusan dihitung sebagai fitur yang diekstrak dan digunakan sebagai input untuk proses klasifikasi. Eksperimen telah diulang selama sepuluh (10) kali untuk memeriksa kesesuaian prestasi semua teknik. 70% data digunakan sebagai data latihan dan 30% lagi digunakan sebagai data ujian. Hasil setelah sepuluh (10) eksperimen menunjukkan bahawa, Support Vector Machine (SVM) memberikan hasil yang paling konsisten dalam mengklasifikasikan keadaan payudara dengan betul sebagai 'Normal' atau 'Kanser', dengan akurasi 99.4%, dalam latihan dan 98.76% dalam ujian. Model klasifikasi SVM telah mengungguli model NN dan NB dalam kajian ini, dan ia menunjukkan bahawa SVM adalah pilihan yang baik untuk menentukan keadaan payudara pada peringkat awal.

2020 ◽  
Vol 9 (2) ◽  
pp. 25-44
Author(s):  
Usha N. ◽  
Sriraam N. ◽  
Kavya N. ◽  
Bharathi Hiremath ◽  
Anupama K Pujar ◽  
...  

Breast cancer is one among the most common cancers in women. The early detection of breast cancer reduces the risk of death. Mammograms are an efficient breast imaging technique for breast cancer screening. Computer aided diagnosis (CAD) systems reduce manual errors and helps radiologists to analyze the mammogram images. The mammogram images are typically in two views, cranial-caudal (CC) and medio lateral oblique (MLO) views. MLO contains pectoral muscles (chest muscles) at the upper right or left corner of the image. In this study, it was removed by using a semi-automated method. All the normal and abnormal images were filtered and enhanced to improve the quality. GLCM (Gray Level Co-occurrence Matrix) texture features were extracted and analyzed by changing the number of features in a feature set. Linear Support Vector Machine (LSVM) was used as classifier. The classification accuracy was improved as the number of features in GLCM feature set increases. Simulation results show an overall classification accuracy of 96.7% with 19 GLCM features using SVM classifiers.


2021 ◽  
Vol 11 (1) ◽  
pp. 15-24
Author(s):  
Dequan Guo ◽  
Gexiang Zhang ◽  
Hui Peng ◽  
Jianying Yuan ◽  
Prithwineel Paul ◽  
...  

In recent years, diseases of cardiovascular and cerebrovascular have attracted much attention due to main causes in death in human beings. To reduce mortality, there are lots of efforts which are focused on early diagnosis and prevention. It is an important reference index for cardiovascular diseases through the endovascular membrane in carotid artery by medical ultrasound images. The paper proposes a method which finds the region of interest (ROI) by convolutional neural network, segments and measures intima-media membrane mainly using support vector machine (SVM). Essentially, the task of detecting the membrane is one target detection problem. This paper adopts the strategy, named Yon Only Look Once (YOLO), a new detection algorithm, and follows the convolution neural network algorithm based on end-to-end training. Firstly, sufficient samples are extracted according to certain characteristics in the special region. It can be trained by the SVM classification model. Then the ROI is processed and all the pixels are classified into boundary points and non-boundary points through the classification model. Thirdly, the boundary points are selected to obtain the accurate boundary and calculate the intima-media thickness (IMT). In experiments, two hundred ultrasound images are tested, and the results verify that our algorithm is consistent with the results by ground truth (GT). The detection speed of the algorithm in this paper is in real time, and it has high generalization characteristics. The algorithm computes the intima-media thickness in ultrasound images accurately and quickly with 95% consistence to ground truth.


2020 ◽  
Vol 7 (1) ◽  
pp. 53
Author(s):  
Derisma Derisma ◽  
Fajri Febrian

Abstrak: Kanker payudara merupakan jenis kanker yang sering ditemukan oleh kebanyakan wanita. Di Indonesia Kanker payudara menempati urutan pertama pada pasien rawat inap di seluruh rumah sakit. Tujuan dari penelitian ini adalah melakukan diagnosis penyakit kanker payudara berbasis komputasi yang dapat menghasilkan bagaimana kondisi kanker seseorang berdasarkan akurasi algoritma. Penelitian ini menggunakan pemrograman orange python dan dataset Wisconsin Breast Cancer untuk pemodelan klasifikasi kanker payudara. Metode data mining yang diterapkan yaitu Neural Network, Support Vector Machine, dan Naive Bayes. Dalam penelitian ini didapat algoritma klasifikasi terbaik yaitu algoritma Kernel SVM dengan tingkat akurasi sebesar  98.9 % dan algoritma terendah yaitu Naive Bayes senilai 96.1 %.   Kata kunci: kanker payudara, neural network, support vector machine, naive bayes   Abstract: Breast cancer is a type of cancer that mostly found in many women. In Indonesia, breast cancer ranks first in hospitalized patients at every hospital. This study aimed to conduct a computation-based diagnose of breast cancer disease that could produce the state of cancer of an individual based on the accuracy of algorithm. This study used python orange programming and Wisconsin Breast Cancer dataset for a modeling and application of breast cancer classification. The data mining methods that were applied in this study were Neural Network, Support Vector Machine, dan Naive Bayes. In this study, Kernel SVM’s algorithm was the best classification algorithm of breast cancer disease with 98.9 % accuracy rate and Naïve Beyes was the lowest with 96.1 % of accuracy rate.   Keywords: breast cancer, neural network, support vector machine, naive bayes


2020 ◽  
Author(s):  
V. Vijayasarveswari ◽  
A.M. Andrew ◽  
M. Jusoh ◽  
T. Sabapathy ◽  
R.A.A. Raof ◽  
...  

AbstractBreast cancer is the most common cancer among women and it is one of the main causes of death for women worldwide. To attain an optimum medical treatment for breast cancer, an early breast cancer detection is crucial. This paper proposes a multistage feature selection method that extracts statistically significant features for breast cancer size detection using proposed data normalization techniques. Ultra-wideband (UWB) signals, controlled using microcontroller are transmitted via an antenna from one end of the breast phantom and are received on the other end. These ultra-wideband analogue signals are represented in both time and frequency domain. The preprocessed digital data is passed to the proposed multi-stage feature selection algorithm. This algorithm has four selection stages. It comprises of data normalization methods, feature extraction, data dimensional reduction and feature fusion. The output data is fused together to form the proposed datasets, namely, 8-HybridFeature, 9-HybridFeature and 10-HybridFeature datasets. The classification performance of these datasets is tested using the Support Vector Machine, Probabilistic Neural Network and Naïve Bayes classifiers for breast cancer size classification. The research findings indicate that the 8-HybridFeature dataset performs better in comparison to the other two datasets. For the 8-HybridFeature dataset, the Naïve Bayes classifier (91.98%) outperformed the Support Vector Machine (90.44%) and Probabilistic Neural Network (80.05%) classifiers in terms of classification accuracy. The finalized method is tested and visualized in the MATLAB based 2D and 3D environment.


2020 ◽  
Vol 3 (1) ◽  
pp. 46-51
Author(s):  
Febri Liantoni ◽  
Agus Santoso

In this era to recognize breast tumors can be based on mammogram images. This method will expedite the process of recognition and classification of breast cancer. This research was conducted classification techniques of breast cancer using mammogram images. The proposed model targets classification studies for cases of malignant, and benign cancer. The research consisted of five main stages, preprocessing, histogram equalization, convolution, feature extraction, and classification. For preprocessing cropping the image using region of interest (ROI), for convolution, median filter and histogram equalization are used to improve image quality. Feature extraction using Gray-Level Co-Occurrence Matrix (GLCM) with 5 features, entropy, correlation, contrast, homogeneity, and variance. The final step is the classification using Radial Basis Function Neural Network (RBFNN) and Support Vector Machine (SVM). Based on the hypotheses that have been tested and discussed, the accuracy for RBFNN is 86.27%, while the accuracy for SVM is 84.31%. This shows that the RBFNN method is better than SVM in distinguishing types of breast cancer. These results prove the process of improving image construction using histogram equalization and the median filter is useful in the classification process.


2021 ◽  
Vol 11 (22) ◽  
pp. 10682
Author(s):  
Pham-The Hien ◽  
Ic-Pyo Hong

Wall-thinning in building structures due to corrosion and surface erosion occurs due to the severe operating conditions and the changing of the surrounding environment, or it can result from poor workmanship and a lack of systematic monitoring during construction. Hence, the continuous monitoring of structures plays an important role in decreasing unexpected accidents. In this paper, a novel method based on the deep neural network and support vector machine approaches is investigated to build up a thickness classification model by incorporating different input features, including the dielectric constants of the material under test, which are extracted from the scattering parameters proceeded by the National Institute of Standards and Technology iterative method. The attained classification results from both machine learning algorithms are then compared and show that both of the models have a good prediction ability. While the deep neural network is the better solution with a large amount of data, the support vector machine is the more appropriate solution when employing small dataset. It can be stated that the proposed method is able to support systematic monitoring as it can help to improve the accuracy of the prediction of material thickness.


Sign in / Sign up

Export Citation Format

Share Document