Applying Support Vector Machines to POS tagging of the Ainu Language

We describe our attempt to apply a state-of-the-art sequential tagger – SVMTool – in the task of automatic part-of-speech annotation of the Ainu language, a critically endangered language isolate spoken by the native inhabitants of northern Japan. Our experiments indicated that it performs better than the custom system proposed in previous research (POST-AL), especially when applied to out-of-domain data. The biggest advantage of the model trained using SVMTool over the POST-AL tagger is its ability to guess part-of-speech tags for OoV words, with the accuracy of up to 63%.

Download Full-text

Contact Lens Classification by Using Segmented Lens Boundary Features

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v11.i3.pp1129-1135 ◽

2018 ◽

Vol 11 (3) ◽

pp. 1129

Author(s):

Nur Ariffin Mohd Zin ◽

Hishammuddin Asmuni ◽

Haza Nuzly Abdul Hamed ◽

Razib M. Othman ◽

Shahreen Kasim ◽

...

Keyword(s):

Support Vector Machines ◽

Contact Lens ◽

State Of The Art ◽

Classification Method ◽

Support Vector ◽

Local Descriptors ◽

Iris Image ◽

Vector Machines ◽

False Reject Rate ◽

Better Than

Recent studies have shown that the wearing of soft lens may lead to performance degradation with the increase of false reject rate. However, detecting the presence of soft lens is a non-trivial task as its texture that almost indiscernible. In this work, we proposed a classification method to identify the existence of soft lens in iris image. Our proposed method starts with segmenting the lens boundary on top of the sclera region. Then, the segmented boundary is used as features and extracted by local descriptors. These features are then trained and classified using Support Vector Machines. This method was tested on Notre Dame Cosmetic Contact Lens 2013 database. Experiment showed that the proposed method performed better than state of the art methods.

Download Full-text

Linear Support Vector Machines for Prediction of Student Performance in School-Based Education

Mathematical Problems in Engineering ◽

10.1155/2020/4761468 ◽

2020 ◽

Vol 2020 ◽

pp. 1-7

Author(s):

Nalindren Naicker ◽

Timothy Adeliyi ◽

Jeanette Wing

Keyword(s):

Machine Learning ◽

Support Vector Machines ◽

Student Performance ◽

State Of The Art ◽

Learning Algorithms ◽

The State ◽

Machine Learning Algorithms ◽

Superior Performance ◽

Support Vector ◽

Vector Machines

Educational Data Mining (EDM) is a rich research field in computer science. Tools and techniques in EDM are useful to predict student performance which gives practitioners useful insights to develop appropriate intervention strategies to improve pass rates and increase retention. The performance of the state-of-the-art machine learning classifiers is very much dependent on the task at hand. Investigating support vector machines has been used extensively in classification problems; however, the extant of literature shows a gap in the application of linear support vector machines as a predictor of student performance. The aim of this study was to compare the performance of linear support vector machines with the performance of the state-of-the-art classical machine learning algorithms in order to determine the algorithm that would improve prediction of student performance. In this quantitative study, an experimental research design was used. Experiments were set up using feature selection on a publicly available dataset of 1000 alpha-numeric student records. Linear support vector machines benchmarked with ten categorical machine learning algorithms showed superior performance in predicting student performance. The results of this research showed that features like race, gender, and lunch influence performance in mathematics whilst access to lunch was the primary factor which influences reading and writing performance.

Download Full-text

Study on Prediction of Chaotic Time Series Using Least Squares Support Vector Machines

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.1061-1062.935 ◽

2014 ◽

Vol 1061-1062 ◽

pp. 935-938

Author(s):

Xin You Wang ◽

Guo Fei Gao ◽

Zhan Qu ◽

Hai Feng Pu

Keyword(s):

Time Series ◽

Support Vector Machine ◽

Support Vector Machines ◽

Least Squares ◽

Prediction Accuracy ◽

Chaotic Time Series ◽

Support Vector ◽

Vector Machines ◽

Online Prediction ◽

Better Than

The predictions of chaotic time series by applying the least squares support vector machine (LS-SVM), with comparison with the traditional-SVM and-SVM, were specified. The results show that, compared with the traditional SVM, the prediction accuracy of LS-SVM is better than the traditional SVM and more suitable for time series online prediction.

Download Full-text

Submodular neural network is better than modular neural network and support vector machines for personal verification

Proceedings of the International Joint Conference on Neural Networks, 2003. ◽

10.1109/ijcnn.2003.1223741 ◽

2004 ◽

Author(s):

T. Nagano ◽

M. Hirahara ◽

H. Eguchi

Keyword(s):

Neural Network ◽

Support Vector Machines ◽

Support Vector ◽

Modular Neural Network ◽

Vector Machines ◽

Better Than

Download Full-text

Embedded Feature Selection for Support Vector Machines: State-of-the-Art and Future Challenges

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-642-25085-9_36 ◽

2011 ◽

pp. 304-311 ◽

Cited By ~ 3

Author(s):

Sebastián Maldonado ◽

Richard Weber

Keyword(s):

Feature Selection ◽

Support Vector Machines ◽

State Of The Art ◽

Support Vector ◽

Vector Machines ◽

Future Challenges ◽

Selection For

Download Full-text

Earth remote sensing imagery classification using a multi-sensor super-resolution fusion algorithm

Computer Optics ◽

10.18287/2412-6179-co-735 ◽

2020 ◽

Vol 44 (4) ◽

pp. 627-635

Author(s):

A.M. Belov ◽

A.Y. Denisova

Keyword(s):

Remote Sensing ◽

Support Vector Machines ◽

Random Forest ◽

Classification Accuracy ◽

Support Vector ◽

Random Forest Algorithm ◽

Earth Remote Sensing ◽

Vector Machines ◽

Fused Image ◽

Better Than

Earth remote sensing data fusion is intended to produce images of higher quality than the original ones. However, the fusion impact on further thematic processing remains an open question because fusion methods are mostly used to improve the visual data representation. This article addresses an issue of the effect of fusion with increasing spatial and spectral resolution of data on thematic classification of images using various state-of-the-art classifiers and features extraction methods. In this paper, we use our own algorithm to perform multi-frame image fusion over optical remote sensing images with different spatial and spectral resolutions. For classification, we applied support vector machines and Random Forest algorithms. For features, we used spectral channels, extended attribute profiles and local feature attribute profiles. An experimental study was carried out using model images of four imaging systems. The resulting image had a spatial resolution of 2, 3, 4 and 5 times better than for the original images of each imaging system, respectively. As a result of our studies, it was revealed that for the support vector machines method, fusion was inexpedient since excessive spatial details had a negative effect on the classification. For the Random Forest algorithm, the classification results of a fused image were more accurate than for the original low-resolution images in 90% of cases. For example, for images with the smallest difference in spatial resolution (2 times) from the fusion result, the classification accuracy of the fused image was on average 4% higher. In addition, the results obtained for the Random Forest algorithm with fusion were better than the results for the support vector machines method without fusion. Additionally, it was shown that the classification accuracy of a fused image using the Random Forest method could be increased by an average of 9% due to the use of extended attribute profiles as features. Thus, when using data fusion, it is better to use the Random Forest classifier, whereas using fusion with the support vector machines method is not recommended.

Download Full-text

Credit Scoring: A Review on Support Vector Machines and Metaheuristic Approaches

Advances in Operations Research ◽

10.1155/2019/1974794 ◽

2019 ◽

Vol 2019 ◽

pp. 1-30 ◽

Cited By ~ 8

Author(s):

R. Y. Goh ◽

L. S. Lee

Keyword(s):

Support Vector Machines ◽

State Of The Art ◽

Credit Scoring ◽

Future Research ◽

Support Vector ◽

Research Gaps ◽

Hybrid Modelling ◽

Vector Machines ◽

Assessment Procedures ◽

Credit Granting

Development of credit scoring models is important for financial institutions to identify defaulters and nondefaulters when making credit granting decisions. In recent years, artificial intelligence (AI) techniques have shown successful performance in credit scoring. Support Vector Machines and metaheuristic approaches have constantly received attention from researchers in establishing new credit models. In this paper, two AI techniques are reviewed with detailed discussions on credit scoring models built from both methods since 1997 to 2018. The main discussions are based on two main aspects which are model type with issues addressed and assessment procedures. Then, together with the compilation of past experiments results on common datasets, hybrid modelling is the state-of-the-art approach for both methods. Some possible research gaps for future research are identified.

Download Full-text

Support Vector Machines based Part of Speech Tagging for Nepali Text

International Journal of Computer Applications ◽

10.5120/12217-8374 ◽

2013 ◽

Vol 70 (24) ◽

pp. 38-42 ◽

Cited By ~ 2

Author(s):

Tej BahadurShahi ◽

Tank Nath Dhamala ◽

Bikash Balami

Keyword(s):

Support Vector Machines ◽

Support Vector ◽

Part Of Speech Tagging ◽

Part Of Speech ◽

Vector Machines ◽

Speech Tagging

Download Full-text

A “Salt and Pepper” Noise Reduction Scheme for Digital Images Based on Support Vector Machines Classification and Regression

The Scientific World JOURNAL ◽

10.1155/2014/826405 ◽

2014 ◽

Vol 2014 ◽

pp. 1-15

Author(s):

Hilario Gómez-Moreno ◽

Pedro Gil-Jiménez ◽

Sergio Lafuente-Arroyo ◽

Roberto López-Sastre ◽

Saturnino Maldonado-Bascón

Keyword(s):

Support Vector Machines ◽

Noise Reduction ◽

Digital Images ◽

State Of The Art ◽

Noise Removal ◽

Support Vector ◽

Salt And Pepper Noise ◽

Vector Machines ◽

Classification And Regression ◽

Salt And Pepper

We present a new impulse noise removal technique based on Support Vector Machines (SVM). Both classification and regression were used to reduce the “salt and pepper” noise found in digital images. Classification enables identification of noisy pixels, while regression provides a means to determine reconstruction values. The training vectors necessary for the SVM were generated synthetically in order to maintain control over quality and complexity. A modified median filter based on a previous noise detection stage and a regression-based filter are presented and compared to other well-known state-of-the-art noise reduction algorithms. The results show that the filters proposed achieved good results, outperforming other state-of-the-art algorithms for low and medium noise ratios, and were comparable for very highly corrupted images.

Download Full-text

Text and metadata extraction from scanned Arabic documents using support vector machines

Journal of Information Science ◽

10.1177/0165551520961256 ◽

2020 ◽

pp. 016555152096125

Author(s):

Wenda Qin ◽

Randa Elanwar ◽

Margrit Betke

Keyword(s):

Support Vector Machines ◽

State Of The Art ◽

Support Vector ◽

Data Sets ◽

Layout Analysis ◽

Data Set ◽

Metadata Extraction ◽

Vector Machines ◽

Text Information ◽

Multiple Support Vector Machines

Text information in scanned documents becomes accessible only when extracted and interpreted by a text recognizer. For a recognizer to work successfully, it must have detailed location information about the regions of the document images that it is asked to analyse. It will need focus on page regions with text skipping non-text regions that include illustrations or photographs. However, text recognizers do not work as logical analyzers. Logical layout analysis automatically determines the function of a document text region, that is, it labels each region as a title, paragraph, or caption, and so on, and thus is an essential part of a document understanding system. In the past, rule-based algorithms have been used to conduct logical layout analysis, using limited size data sets. We here instead focus on supervised learning methods for logical layout analysis. We describe LABA, a system based on multiple support vector machines to perform logical Layout Analysis of scanned Books pages in Arabic. The system detects the function of a text region based on the analysis of various images features and a voting mechanism. For a baseline comparison, we implemented an older but state-of-the-art neural network method. We evaluated LABA using a data set of scanned pages from illustrated Arabic books and obtained high recall and precision values. We also found that the F-measure of LABA is higher for five of the tested six classes compared to the state-of-the-art method.

Download Full-text