Prediction of lysine formylation sites using support vector machine based on the sample selection from majority classes and synthetic minority over-sampling techniques

Biochimie ◽  
2021 ◽  
Author(s):  
Md Sohrawordi ◽  
Md Ali Hossain
2015 ◽  
Vol 11 (4) ◽  
pp. 14 ◽  
Author(s):  
Pan Xin ◽  
Hongbin Sun

Advancements in remote sensing technology have led to improvements in the acquisition of land cover information. The extraction of accurate and timely knowledge about land cover from remote sensing imagery largely depends on the classification techniques used. Support vector machine has been receiving considerable attention as a promising method for classifying remote sensing imagery. However, the support vector machine learning process typically requires a large memory and significant computation time for treating a large sample set, in which some of the samples might be redundant and useless for the support vector machine model training. Therefore, higher-quality and fewer samples from the sample selection should be utilized for support vector machine-based remote sensing classification. A convex theory-based remote sensing sample selection algorithm for support vector machine classifiers is developed in this work. A Landsat-5 Thematic Mapper imagery acquired on August 31, 2009 (orbit number 113/27) is adopted in our experiments. The study area's land cover/use was divided into five categories. Using the region of interest tool, we select samples from the image of the study area, with each category consisting of 1000 independent pixels. Results show that for most cases, our method can achieve higher classification accuracy than random sample selection method.


Author(s):  
Meng Wang ◽  
Xian-Sheng Hua ◽  
Jinhui Tang ◽  
Guo-Jun Qi

This chapter introduces the application of active learning in video annotation. The insufficiency of training data is a major obstacle in learning-based video annotation. Active learning is a promising approach to dealing with this difficulty. It iteratively annotates a selected set of most informative samples, such that the obtained training set is more effective than that gathered randomly. The authors present a brief review of the typical active learning approaches. They categorize the sample selection strategies in these methods into five criteria, that is, risk reduction, uncertainty, positivity, density, and diversity. In particular, they introduce the Support Vector Machine (SVM)-based active learning scheme which has been widely applied. Afterwards, they analyze the deficiency of the existing active learning methods for video annotation, that is, in most of these methods the to-be-annotated concepts are treated equally without preference and only one modality is applied. To address these two issues, the authors introduce a multi-concept multi-modality active learning scheme. This scheme is able to better explore human labeling effort by considering both the learnabilities of different concepts and the potential of different modalities.


2020 ◽  
Author(s):  
V Vasilevska ◽  
K Schlaaf ◽  
H Dobrowolny ◽  
G Meyer-Lotz ◽  
HG Bernstein ◽  
...  

2019 ◽  
Vol 15 (2) ◽  
pp. 275-280
Author(s):  
Agus Setiyono ◽  
Hilman F Pardede

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam.  One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.


Sign in / Sign up

Export Citation Format

Share Document