TrCSVM: a novel approach for the classification of melanoma skin cancer using transfer learning

2020 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Lokesh Singh ◽  
Rekh Ram Janghel ◽  
Satya Prakash Sahu

PurposeThe study aims to cope with the problems confronted in the skin lesion datasets with less training data toward the classification of melanoma. The vital, challenging issue is the insufficiency of training data that occurred while classifying the lesions as melanoma and non-melanoma.Design/methodology/approachIn this work, a transfer learning (TL) framework Transfer Constituent Support Vector Machine (TrCSVM) is designed for melanoma classification based on feature-based domain adaptation (FBDA) leveraging the support vector machine (SVM) and Transfer AdaBoost (TrAdaBoost). The working of the framework is twofold: at first, SVM is utilized for domain adaptation for learning much transferrable representation between source and target domain. In the first phase, for homogeneous domain adaptation, it augments features by transforming the data from source and target (different but related) domains in a shared-subspace. In the second phase, for heterogeneous domain adaptation, it leverages knowledge by augmenting features from source to target (different and not related) domains to a shared-subspace. Second, TrAdaBoost is utilized to adjust the weights of wrongly classified data in the newly generated source and target datasets.FindingsThe experimental results empirically prove the superiority of TrCSVM than the state-of-the-art TL methods on less-sized datasets with an accuracy of 98.82%.Originality/valueExperiments are conducted on six skin lesion datasets and performance is compared based on accuracy, precision, sensitivity, and specificity. The effectiveness of TrCSVM is evaluated on ten other datasets towards testing its generalizing behavior. Its performance is also compared with two existing TL frameworks (TrResampling, TrAdaBoost) for the classification of melanoma.

2017 ◽  
Vol 9 (4) ◽  
pp. 416 ◽  
Author(s):  
Nelly Indriani Widiastuti ◽  
Ednawati Rainarli ◽  
Kania Evita Dewi

Classification is the process of grouping objects that have the same features or characteristics into several classes. The automatic documents classification use words frequency that appears on training data as features. The large number of documents cause the number of words that appears as a feature will increase. Therefore, summaries are chosen to reduce the number of words that used in classification. The classification uses multiclass Support Vector Machine (SVM) method. SVM was considered to have a good reputation in the classification. This research tests the effect of summary as selection features into documents classification. The summaries reduce text into 50%. A result obtained that the summaries did not affect value accuracy of classification of documents that use SVM. But, summaries improve the accuracy of Simple Logistic Classifier. The classification testing shows that the accuracy of Naïve Bayes Multinomial (NBM) better than SVM


Author(s):  
D. Wittich ◽  
F. Rottensteiner

<p><strong>Abstract.</strong> Domain adaptation (DA) can drastically decrease the amount of training data needed to obtain good classification models by leveraging available data from a source domain for the classification of a new (target) domains. In this paper, we address deep DA, i.e. DA with deep convolutional neural networks (CNN), a problem that has not been addressed frequently in remote sensing. We present a new method for semi-supervised DA for the task of pixel-based classification by a CNN. After proposing an encoder-decoder-based fully convolutional neural network (FCN), we adapt a method for adversarial discriminative DA to be applicable to the pixel-based classification of remotely sensed data based on this network. It tries to learn a feature representation that is domain invariant; domain-invariance is measured by a classifier’s incapability of predicting from which domain a sample was generated. We evaluate our FCN on the ISPRS labelling challenge, showing that it is close to the best-performing models. DA is evaluated on the basis of three domains. We compare different network configurations and perform the representation transfer at different layers of the network. We show that when using a proper layer for adaptation, our method achieves a positive transfer and thus an improved classification accuracy in the target domain for all evaluated combinations of source and target domains.</p>


2019 ◽  
Vol 30 (1) ◽  
pp. 105
Author(s):  
Mohammed Hussein ◽  
Amel H. Abbas

Abstract Agriculture has special importance in that it is a major source of food and clothing and is an important economic source for countries. Agriculture is affected by a variety of factors, biotic such as diseases resulting from bacteria, fungi, viruses and non-biotic such as water and temperature and other environmental factors. detection of these diseases requires people to expert in addition to a set of equipment and it is expensive in terms of time and money Therefore, the development of a computer based system that detection the diseases of plants is very helpful for farmers As well as to specialists in the field of plant protection. the proposed plant disease detection system consists of two phases, in the first phase we establish the knowledge base and this by introducing a set of training samples in a series of processing that include first use pre-processing techniques such cropping , resizing, fuzzy histogram equalization ,next extract a set of color and texture feature and used to great the knowledge base that used as training data for support vector machine classifier . In the second phase of the work we use the classifier that was trained using the knowledge base for detection and diagnosis of plant leaf diseases. To create the knowledge base we used 799 sample images and divided it by 80% training and 20% testing. We have use Three crops each yield three diseases in addition to the proper state of each crop .the accuracy of disease detection was 88.1% .


Author(s):  
Guokai Liu ◽  
Liang Gao ◽  
Weiming Shen ◽  
Andrew Kusiak

Abstract Condition monitoring and fault diagnosis are of great interest to the manufacturing industry. Deep learning algorithms have shown promising results in equipment prognostics and health management. However, their success has been hindered by excessive training time. In addition, deep learning algorithms face the domain adaptation dilemma encountered in dynamic application environments. The emerging concept of broad learning addresses the training time and the domain adaptation issue. In this paper, a broad transfer learning algorithm is proposed for the classification of bearing faults. Data of the same frequency is used to construct one- and two-dimensional training data sets to analyze performance of the broad transfer and deep learning algorithms. A broad learning algorithm contains two main layers, an augmented feature layer and a classification layer. The broad learning algorithm with a sparse auto-encoder is employed to extract features. The optimal solution of a redefined cost function with a limited sample size to ten per class in the target domain offers the classifier of broad learning domain adaptation capability. The effectiveness of the proposed algorithm has been demonstrated on a benchmark dataset. Computational experiments have demonstrated superior efficiency and accuracy of the proposed algorithm over the deep learning algorithms tested.


Author(s):  
VAHID BEHBOOD ◽  
JIE LU ◽  
GUANGQUAN ZHANG

Machine learning methods, such as neural network (NN) and support vector machine, assume that the training data and the test data are drawn from the same distribution. This assumption may not be satisfied in many real world applications, like long-term financial failure prediction, because the training and test data may each come from different time periods or domains. This paper proposes a novel algorithm known as fuzzy bridged refinement-based domain adaptation to solve the problem of long-term prediction. The algorithm utilizes the fuzzy system and similarity concepts to modify the target instances' labels which were initially predicted by a shift-unaware prediction model. The experiments are performed using three shift-unaware prediction models based on nine different settings including two main situations: (1) there is no labeled instance in the target domain; (2) there are a few labeled instances in the target domain. In these experiments bank failure financial data is used to validate the algorithm. The results demonstrate a significant improvement in the predictive accuracy, particularly in the second situation identified above.


2021 ◽  
Vol 11 (3) ◽  
pp. 948-954
Author(s):  
Xiang Chen ◽  
Lijun Xu ◽  
Ming Cao ◽  
Tinghua Zhang ◽  
Zhongan Shang ◽  
...  

At present, the demand for intelligentization of human-computer interaction systems (HCIS) has become increasingly prominent. Being able to recognize the emotions of users of interactive systems is a distinguishing feature of intelligent interactive systems. The intelligent HCIS can analyze the emotional changes of patients with depression, complete the interaction with the patients in a more appropriate manner, and the recognition results can assist family members or medical personnel to make response measures based on the patient’s emotional changes. Based on this background, this paper proposes a sentiment recognition method based on transfer support vector machines (TSVM) and EEG signals. The ER (ER) results based on this method are applied to HCIS. Such a HCIS is mainly used for the interaction of patients with depression. When a new field related to a certain field appears, if the new field data is relabeled, the sample is expensive, and it is very wasteful to discard all the old field data. The main innovation of this research is that the introduced classification model is TSVM. TSVM is a transfer learning strategy based on SVM. Transfer learning aims to solve related but different target domain problems by using a large amount of labeled source domain data. Therefore, the transfer support vector machine based on the transfer mechanism can use the small labeled data of the target domain and a large amount of old data in the related domain to build a high-quality classification model for the target domain, which can effectively improve the accuracy of classification. Comparing the classification results with other classification models, it can be concluded that TSVM can effectively improve the accuracy of ER in patients with depression. The HCIS based on the classification model has higher accuracy and better stability.


2021 ◽  
Vol 23 (08) ◽  
pp. 616-624
Author(s):  
Gaddam Akhil Reddy ◽  
◽  
Dr. B. Indira Reddy ◽  

The necessity for spam detection is particularly pertinent nowadays, as there is no quality control over social media, and users have the ability to distribute unverified material, therefore facilitating fraud and deceit. Spam detection can aid in the prevention of such fraud. This scenario has developed mostly as a result of the distribution of disparate, unconfirmed information via shopping websites, emails, and text messages (SMS). There are several ways of categorising and identifying spam. Each of them has certain advantages and disadvantages. The machine learning model “Support Vector Machine” is employed to detect spam in this case. SVM is a basic concept: the method proposes a line or hyperplane to classify the data. The model can categorise any type of text into a given category after being fed a set of labelled training data for each category.


CCIT Journal ◽  
2017 ◽  
Vol 10 (2) ◽  
pp. 197-206
Author(s):  
Atika Rahmawati ◽  
Aris Marjuni ◽  
Junta Zeniarja

Pilkada Serentak is a very important event for the future viability regions and countries. Through this election people can cast their vote and elect representatives of the people according to their choice. Public respond can be expressed through twitter social media. Using twitter social media sentiment analysis can then be made about the public response to the implementation of the election simultaneously. The classification process can be detected via text tweeted by twitter users. In this study, the classification of responses detected by text because it is easily obtained and applied. This study determined the classification of the response to the Indonesian language text and increase accuracy by using SVM.Tweet classification method used by the categorical approach is divided into two classes tweet basic level: positive and negative. Data collected from Indonesian twitter tweet as much as 3000. The labeling is not done manually but using clustering method that divides the 3000 data into two groups. Cluster 1 as a group of positive tweets and Cluster 2 as a negative group tweet.2700 for training data and 300 for the test data. The stage of pre-processing the data includetokenization, casenormalization, stop word detection, and stemming. The process of classification using Support Vector Machine (SVM). Accuracy of SVM showed the highest yield that is 91% compared to the k-means clustering with the results of 82%.


Algorithms ◽  
2019 ◽  
Vol 12 (12) ◽  
pp. 271 ◽  
Author(s):  
Yuntian Feng ◽  
Guoliang Wang ◽  
Zhipeng Liu ◽  
Runming Feng ◽  
Xiang Chen ◽  
...  

Aiming at the current problem that it is difficult to deal with an unknown radar emitter in the radar emitter identification process, we propose an unknown radar emitter identification method based on semi-supervised and transfer learning. Firstly, we construct the support vector machine (SVM) model based on transfer learning, using the information of labeled samples in the source domain to train in the target domain, which can solve the problem that the training data and the testing data do not satisfy the same-distribution hypothesis. Then, we design a semi-supervised co-training algorithm using the information of unlabeled samples to enhance the training effect, which can solve the problem that insufficient labeled data results in inadequate training of the classifier. Finally, we combine the transfer learning method with the semi-supervised learning method for the unknown radar emitter identification task. Simulation experiments show that the proposed method can effectively identify an unknown radar emitter and still maintain high identification accuracy within a certain measurement error range.


Sign in / Sign up

Export Citation Format

Share Document