Varіatіons іn the predіctіve effіcіency of soіl maps dependіng on the methods of constructіng traіnіng samples of predіcatіve algorіthms

2017 ◽  
Vol 28 (3-4) ◽  
pp. 55-71
Author(s):  
V. R. Cherlіnka

The maіn objectіve was to study the іnfluence of the traіnіng dataset on the qualіtatіve characterіstіcs of sіmulatіve soіl maps, whіch are obtaіned through sіmulatіon usіng a typіcal set of materіals that can be potentіally avaіlable for the soіl scіentіst іn modern Ukraіnіan realіtіes. Achіevement of thіs goal was achіeved by solvіng a number of the followіng tasks: a) dіgіtіzіng of cartographіc materіals; b) creatіng DEM wіth a resolutіon equal to 10 m; c) analysіs of dіgіtal elevatіon models and extractіon of land surface parameters; d) generatіon of traіnіng datasets accordіng to the descrіbed methodologіcal approaches; e) creatіon sіmulatіon models of soіl-cover іn R-statіstіc; g) analysіs of the obtaіned results and conclusіons regardіng the optіmal sіze of the traіnіng datasets for predіctіve modelіng of the soіl cover and іts duratіon. As an object was selected a fragment of the terrіtory of Ukraіne (4200×4200 m) wіthіn the lіmіts of Glybotsky dіstrіct of the Chernіvtsі regіon, confіned to the Prut-Sіret іnterfluve (North Bukovyna) wіth contrast geomorphologіcal condіtіons. Thіs area has dіfferent admіnіstratіve subordіnatіon and economіc use but іs covered wіth soіl cartographіc materіals only by 49.43 %. For data processіng were used іnstrumental possіbіlіtіes of free software: geo- rectіfіcatіons of maps materіal – GІS Quantum, dіgіtalіzatіon – Easy Trace, preparatіon of maps morphometrіc parameters – GRASS GІS and buіldіng sіmulatіve soіl maps – R, a language and envіronment for statіstіcal computіng. To create sіmulatіon models of soіl cover, a R-statіstіc scrіpt was wrіtten that іncludes a number of adaptatіons for solvіng set tasks and іmplements the dіfferent types of predіcatіve algorіthms such as: Multіnomіal Logіstіc Regressіon, Decіsіon Trees, Neural Networks, Random Forests, K-Nearest Neіghbors, Support Vector Machіnes and Bagged Trees. To assess the qualіty of the obtaіned models, the Cohen’s Kappa Іndex (?) was used whіch best represents the degree of complіance between the orіgіnal and the sіmulated data. As a benchmark, the usual medіal axes traіnіng dataset of was used. Other study optіons were: medіan-weіghted and randomіzed-weіghted samplіng. Thіs together wіth 7 predіcatіve algorіthms allowed to get 72 soіl sіmulatіons, the analysіs of whіch revealed quіte іnterestіng patterns. Models rankіng by іncreasіng the qualіty of the predіctіon by the kappa of the maіn data set shown, that the MLR algorіthm showed the worst results among others. Next іn ascendіng order are Neural Network, SVM, KNN, BGT, RF, DT. The last three algorіthms refer to the classіfіcatіon and theіr hіgh results іndіcate the greatest suіtabіlіty of such approaches іn sіmulatіon of soіl cover. The sample based on the weіghted medіan dіd not show strong advantages over others, as the results are quіte controversіal. Only іn the case of the neural network and the Bugget Trees the results of the medіan-weіghted sample predіctіon showed a better result vs a sіmple medіan sample and much worse than any varіants of randomіzed traіnіng data. Other algorіthms requіred a dіfferent number of randomіzed poіnts to cross the 90 % kappa: KNN – 25 %; BGT, RF and DT – 90 %. To achіeve 95 % kappa BGT algorіthm requіres 30% traіnіng poіnts of the total, RF – 25 % and DT – 20 %. Decіsіon Trees as a result turned out to be the most powerful algorіthm, whіch was able to sіmulate the dіstrіbutіon of soіl abnormalіtіes from kappa 97.13 % wіth 35 % saturatіon of the traіnіng sample wіth the orіgіnal data. Overall, DT shows a great dіfference between the approaches to selectіng traіnіng data: any medіan falls by 13 % іn front of a sіmple 5 % randomіzed-weіghted set of traіnіng cells and 22 % – about 35 % of the set.

2019 ◽  
Vol 2019 (02) ◽  
pp. 89-98
Author(s):  
Vijayakumar T

Predicting the category of tumors and the types of the cancer in its early stage remains as a very essential process to identify depth of the disease and treatment available for it. The neural network that functions similar to the human nervous system is widely utilized in the tumor investigation and the cancer prediction. The paper presents the analysis of the performance of the neural networks such as the, FNN (Feed Forward Neural Networks), RNN (Recurrent Neural Networks) and the CNN (Convolutional Neural Network) investigating the tumors and predicting the cancer. The results obtained by evaluating the neural networks on the breast cancer Wisconsin original data set shows that the CNN provides 43 % better prediction than the FNN and 25% better prediction than the RNN.


2013 ◽  
Vol 13 (01) ◽  
pp. 1350018 ◽  
Author(s):  
GUANGYING YANG

Electrocardiography (ECG) is a transthoracic interpretation of the electrical activity of the heart over a period of time, as detected by electrodes attached to the outer surface of the skin and recorded by a device external to the body. ECG signal classification is very important for the clinical detection of arrhythmia. This paper presents an application of an improved wavelet neural network structure to the classification of the ECG beats, because of the high precision and fast learning rate. Feature extraction method in this paper is wavelet transform. Our experimental data set is taken from the MIT-BIH arrhythmia database. The correct detection rate of QRS wave is 95% by testing the data of MIT-BIH database. The proposed methods are applied to a large number of ECG signals consisting of 600 training samples and 120 test samples from the MIT-BIH database. The samples equally represent six different ECG signal types, including normal beat, atrial premature beat, ventricular premature beat, left bundle branch block, right bundle branch block and paced beat. In comparison with pattern recognition methods of BP neural networks, RBF neural networks and Support Vector Machines (SVM), the results in this experiment prove that the wavelet neural network method has a better recognition rate when classifying electrocardiogram signals. The experimental results prove that supposed method in this paper is effective for arrhythmia pattern recognition field.


2019 ◽  
Author(s):  
Daniel Cleather

Musculoskeletal models have been used to estimate the muscle and joint contact forces expressed during movement. One limitation of this approach, however, is that such models are computationally demanding, which limits the possibility of using them for real-time feedback. One solution to this problem is to train a neural network to approximate the performance of the model, and then to use the neural network to give real-time feedback. In this study, neural networks were trained to approximate the FreeBody musculoskeletal model for jumping and landing tasks. The neural networks were better able to approximate jumping than landing, which was probably a result of the greater variability in the landing data set used in this study. In addition, a neural network that was based on a reduced set of inputs was also trained to approximate the outputs of FreeBody during a landing task. These results demonstrate the feasibility of using neural networks to approximate the results of musculoskeletal models in order to provide real-time feedback. In addition, these neural networks could be based upon a reduced set of kinematic variables taken from a 2-dimensional video record, making the implementation of mobile applications a possibility.


2020 ◽  
Vol 44 (6) ◽  
pp. 923-930
Author(s):  
I.A. Rodin ◽  
S.N. Khonina ◽  
P.G. Serafimovich ◽  
S.B. Popov

In this work, we carried out training and recognition of the types of aberrations corresponding to single Zernike functions, based on the intensity pattern of the point spread function (PSF) using convolutional neural networks. PSF intensity patterns in the focal plane were modeled using a fast Fourier transform algorithm. When training a neural network, the learning coefficient and the number of epochs for a dataset of a given size were selected empirically. The average prediction errors of the neural network for each type of aberration were obtained for a set of 15 Zernike functions from a data set of 15 thousand PSF pictures. As a result of training, for most types of aberrations, averaged absolute errors were obtained in the range of 0.012 – 0.015. However, determining the aberration coefficient (magnitude) requires additional research and data, for example, calculating the PSF in the extrafocal plane.


Author(s):  
Metin DEMIRTAS ◽  
Musa ALCI

The aim of this paper is to compare the neural network and fuzzy modeling approaches on a nonlinear system. We have taken Permanent Magnet Brushless Direct Current (PMBDC) motor data and have generated models using both approaches. The predictive performance of both methods was compared on the data set for model configurations.The paper describes the results of these tests and discusses the effects of changing model parameters on predictive and practical performance. Modeling sensitivity was used to compare for two methods. 


2013 ◽  
Vol 13 (4) ◽  
pp. 1077-1083 ◽  
Author(s):  
M. Akhoondzadeh

Abstract. In this paper, a number of classical and intelligent methods, including interquartile, autoregressive integrated moving average (ARIMA), artificial neural network (ANN) and support vector machine (SVM), have been proposed to quantify potential thermal anomalies around the time of the 11 August 2012 Varzeghan, Iran, earthquake (Mw = 6.4). The duration of the data set, which is comprised of Aqua-MODIS land surface temperature (LST) night-time snapshot images, is 62 days. In order to quantify variations of LST data obtained from satellite images, the air temperature (AT) data derived from the meteorological station close to the earthquake epicenter has been taken into account. For the models examined here, results indicate the following: (i) ARIMA models, which are the most widely used in the time series community for short-term forecasting, are quickly and easily implemented, and can efficiently act through linear solutions. (ii) A multilayer perceptron (MLP) feed-forward neural network can be a suitable non-parametric method to detect the anomalous changes of a non-linear time series such as variations of LST. (iii) Since SVMs are often used due to their many advantages for classification and regression tasks, it can be shown that, if the difference between the predicted value using the SVM method and the observed value exceeds the pre-defined threshold value, then the observed value could be regarded as an anomaly. (iv) ANN and SVM methods could be powerful tools in modeling complex phenomena such as earthquake precursor time series where we may not know what the underlying data generating process is. There is good agreement in the results obtained from the different methods for quantifying potential anomalies in a given LST time series. This paper indicates that the detection of the potential thermal anomalies derive credibility from the overall efficiencies and potentialities of the four integrated methods.


2006 ◽  
Vol 15 (03) ◽  
pp. 397-410 ◽  
Author(s):  
MANNES POEL ◽  
TACO EKKEL

Based on the hypothesis that the sound of the infant cry contains information on the infant's health status, research has been done on how to improve classification of neonate crying sounds into categories called 'normal' and 'abnormal' - the latter referring to some hypoxia-related disorder. Research in this field is hindered by lack of test cases and limited understanding of feature relevance. The research described here combines various ways of dealing with the small data set problem. First, feature pre-selection is done using sequential backwards elimination of possible combinations where the performance of the set of features is tested by a Probabilistic Neural Network which has the advantage of fast learning. Using these features a neural network committee, consisting of Radial Basis Function Neural Networks, was trained on the data, using bootstrapping. This construction yields a multi-classifier system with an overall classification performance of 85% on the so-called "All Cry Units" (ACU) data set, an increase of 34% with respect to the a priori probability of 51%. Several leave-1-out experiments for Linear Discriminant Analysis (LDA), Support Vector Machines (SVM) and Neural Networks (NN) have been conducted in order to compare the performance of the multi-classifier system.


Author(s):  
Evgenii E. Marushko ◽  
Alexander A. Doudkin ◽  
Xiangtao Zheng

The paper proposes an identification technique of objects on the Earth’s surface images based on combination of machine learning methods. Different variants of multi-layer convolutional neural networks and support vector machines are considered as original models. A hybrid convolutional neural network that combines features extracted by the neural network and experts is proposed. Optimal values of hyperparameters of the models are calculated by grid search methods using k-fold cross-validation. The possibility of improving the accuracy of identification based on the ensembles of these models is shown. Effectiveness of the proposed technique is demonstrated by the example of images obtained by synthetic aperture radar.


Sign in / Sign up

Export Citation Format

Share Document