scholarly journals Mapping Mineral Prospectivity Using a Hybrid Genetic Algorithm–Support Vector Machine (GA–SVM) Model

2021 ◽  
Vol 10 (11) ◽  
pp. 766
Author(s):  
Xishihui Du ◽  
Kefa Zhou ◽  
Yao Cui ◽  
Jinlin Wang ◽  
Shuguang Zhou

Machine learning (ML) as a powerful data-driven method is widely used for mineral prospectivity mapping. This study employs a hybrid of the genetic algorithm (GA) and support vector machine (SVM) model to map prospective areas for Au deposits in Karamay, northwest China. In the proposed method, GA is used as an adaptive optimization search method to optimize the SVM parameters that result in the best fitness. After obtaining evidence layers from geological and geochemical data, GA–SVM models trained using different training datasets were applied to discriminate between prospective and non-prospective areas for Au deposits, and to produce prospectivity maps for mineral exploration. The F1 score and spatial efficiency of classification were calculated to objectively evaluate the performance of each prospectivity model. The best model predicted 95.83% of the known Au deposits within prospective areas, occupying 35.68% of the study area. The results demonstrate the effectiveness of the GA–SVM model as a tool for mapping mineral prospectivity.

Molecules ◽  
2020 ◽  
Vol 25 (6) ◽  
pp. 1442 ◽  
Author(s):  
Tao Shen ◽  
Hong Yu ◽  
Yuan-Zhong Wang

Gentiana, which is one of the largest genera of Gentianoideae, most of which had potential pharmaceutical value, and applied to local traditional medical treatment. Because of the phytochemical diversity and difference of bioactive compounds among species, which makes it crucial to accurately identify authentic Gentiana species. In this paper, the feasibility of using the infrared spectroscopy technique combined with chemometrics analysis to identify Gentiana and its related species was studied. A total of 180 batches of raw spectral fingerprints were obtained from 18 species of Gentiana and Tripterospermum by near-infrared (NIR: 10,000–4000 cm−1) and Fourier transform mid-infrared (MIR: 4000–600 cm−1) spectrum. Firstly, principal component analysis (PCA) was utilized to explore the natural grouping of the 180 samples. Secondly, random forests (RF), support vector machine (SVM), and K-nearest neighbors (KNN) models were built while using full spectra (including 1487 NIR variables and 1214 FT-MIR variables, respectively). The MIR-SVM model had a higher classification accuracy rate than the other models that were based on the results of the calibration sets and prediction sets. The five feature selection strategies, VIP (variable importance in the projection), Boruta, GARF (genetic algorithm combined with random forest), GASVM (genetic algorithm combined with support vector machine), and Venn diagram calculation, were used to reduce the dimensions of the data variable in order to further reduce numbers of variables for modeling. Finally, 101 NIR and 73 FT-MIR bands were selected as the feature variables, respectively. Thirdly, stacking models were built based on the optimal spectral dataset. Most of the stacking models performed better than the full spectra-based models. RF and SVM (as base learners), combined with the SVM meta-classifier, was the optimal stacked generalization strategy. For the SG-Ven-MIR-SVM model, the accuracy (ACC) of the calibration set and validation set were both 100%. Sensitivity (SE), specificity (SP), efficiency (EFF), Matthews correlation coefficient (MCC), and Cohen’s kappa coefficient (K) were all 1, which showed that the model had the optimal authenticity identification performance. Those parameters indicated that stacked generalization combined with feature selection is probably an important technique for improving the classification model predictive accuracy and avoid overfitting. The study result can provide a valuable reference for the safety and effectiveness of the clinical application of medicinal Gentiana.


2014 ◽  
Vol 587-589 ◽  
pp. 2100-2104
Author(s):  
Qin Liu ◽  
Jian Min Xu ◽  
Kai Lu

Oversaturation in the modern urban traffic often happens. In order to describe the degree of oversaturation, the indexes of intersection oversaturation degree are put forward include dissipation time, stranded queue, overflow queue and travel speed. On the basis of selected indexes, the genetic algorithm support vector machine (GA-SVM) model was proposed to quantify the degree of oversaturation. In this method the genetic algorithm is used to select the model parameters. The GA-SVM model built is used to quantify the degree of oversaturation. Combining with the volume of intersections in Guangzhou city the method is calculated and simulated through programming. The simulation results show that GA-SVM method is effective and the accuracy of GA-SVM is higher than support vector machine (SVM).This method provides a theoretical basis for the analysis of traffic system under over-saturated traffic conditions.


2012 ◽  
Vol 446-449 ◽  
pp. 3037-3041 ◽  
Author(s):  
Fang Xiao

Forest fire disaster area prediction based on genetic algorithm and support vector machine is presented in the paper.Genetic algorithm is used to select appropriate parameters of support vector machine. Genetic algorithm can obtain the optimal solution by a series of iterative computations.The forest fire disaster area data in Jiangxi Province from 1970 to 1997 are used as our research data. The comparison of the forest fire disaster area forecasting results between the proposed GA-SVM model and the SVM model is given,which indicates that the proposed GA-SVM model has more excellent forest fire disaster area forecasting results than the SVM model.


Sensors ◽  
2018 ◽  
Vol 18 (10) ◽  
pp. 3222 ◽  
Author(s):  
Di Wang ◽  
Lin Xie ◽  
Simon Yang ◽  
Fengchun Tian

Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification in the tobacco industry. In this paper, a novel method based on the support vector machine (SVM) is proposed to discriminate the tobacco cultivation region using the near-infrared (NIR) sensors, where the genetic algorithm (GA) is employed for input subset selection to identify the effective principal components (PCs) for the SVM model. With the same number of PCs as the inputs to the SVM model, a number of comparative experiments were conducted between the effective PCs selected by GA and the PCs orderly starting from the first one. The model performance was evaluated in terms of prediction accuracy and four parameters of assessment criteria (true positive rate, true negative rate, positive predictive value and F1 score). From the results, it is interesting to find that some PCs with less information may contribute more to the cultivation regions and are considered as more effective PCs, and the SVM model with the effective PCs selected by GA has a superior discrimination capacity. The proposed GA-SVM model can effectively learn the relationship between tobacco cultivation regions and tobacco NIR sensor data.


2014 ◽  
Vol 628 ◽  
pp. 383-389 ◽  
Author(s):  
Ya Hui Peng ◽  
Kang Peng ◽  
Jian Zhou ◽  
Zhi Xiang Liu

Due to the complex features of rock burst hazard assessment systems, a support vector machine (SVM) model for predicting of classification of rock burst was established based on the SVM theory and the actual characteristics of the project in this study. The main factors of rock burst, such as coal seam, dip, buried depth, structure situation, change of pitch angle, change of coal thickness, gas concentration, roof management, pressure relief and shooting were defined as the criterion indices for rock burst prediction in the proposed model. In order to determine reasonable and efficient the parameters of SVM, Firstly, the appropriate fitness function for genetic algorithms (GA) operation was determined, and then optimization parameters of SVM model were selected by real coded GA, therefore, the genetic algorithms and support vector machine (GSVM) model was established. A GSVM model was obtained through training 23 sets of measured data, the cross-validation method was introduced to verify the stability of GSVM model and the ratio of mis-discrimination is 0. Moreover, the proposed model was used to predict 12 new samples rock burst, the correct rate of prediction results is 91.6667% and are identical with actual situation. The results show that the genetic algorithm can speed up SVM parameter optimization search, the proposed model has a high credibility in the study of rock burst prediction of risk classification, which can be applied to practical engineering.


2011 ◽  
Vol 332-334 ◽  
pp. 1198-1201 ◽  
Author(s):  
Yun Hui Yang ◽  
Yi Ping Ji

Distinguishing of wool and cashmere is one of the toughest problems in fiber identification area. Support Vector Machine (SVM) was advanced here to classify fibers, and Genetic Algorithm (GA) was used to optimize multi-parameters of SVM simultaneously. Experimental results show that it plays full part of the GA, and accelerates the optimization search of SVM parameters. The model established is of practical significance in identification of wool and cashmere.


Sign in / Sign up

Export Citation Format

Share Document