scholarly journals Oil Family Typing Using a Hybrid Model of Self-Organizing Map and Artificial Neural Network

2021 ◽  
Author(s):  
Amir Mosavi ◽  
Majid

Identifying the number of oil families in petroleum basins provides practical and valuable information in petroleum geochemistry studies from exploration to development. Oil family grouping helps us track migration pathways, identify the number of active source rock(s), and examine the reservoir continuity. To date, almost in all oil family typing studies, common statistical methods such as principal component analysis (PCA) and hierarchical clustering analysis (HCA) have been used. However, there is no publication regarding using artificial neural networks (ANNs) for examining the oil families in petroleum basins. Hence, oil family typing requires novel, not overused and common techniques. This paper is the first report of oil family typing using ANNs as robust computational methods. To this end, a self-organization map (SOM) neural network associated with three clustering validity indices were employed on oil samples belonging to the Iranian part of the Persian Gulf’ oilfields. For the SOM network, at first, ten default clusters were selected. Afterwards, three effective clustering validity coefficients, namely Calinski-Harabasz (CH), Silhouette indexes (SI) and Davies-Bouldin (DB), were operated to find the optimum number of clusters. Accordingly, among ten default clusters, the maximum CH (62) and SI (0.58) were acquired for four clusters. Likewise, the lowest DB (0.8) was obtained for four clusters. Thus, all three validation coefficients introduced four clusters as the optimum number of clusters or oil families. The number of oil families identified in the present report is consistent with those previously reported by other researchers in the same study area. However, the techniques used in the present paper, which have not been implemented so far, can be introduced as more straightforward for clustering purposes in the oil family typing than those of common and overused methods of PCA and HCA.

2021 ◽  
Author(s):  
Majid ◽  
Amir Mosavi

Identifying the number of oil families in petroleum basins provides practical and valuable information in petroleum geochemistry studies from exploration to development. Oil family grouping helps us track migration pathways, identify the number of active source rock(s), and examine the reservoir continuity. To date, almost in all oil family typing studies, common statistical methods such as principal component analysis (PCA) and hierarchical clustering analysis (HCA) have been used. However, there is no publication regarding using artificial neural networks (ANNs) for examining the oil families in petroleum basins. Hence, oil family typing requires novel, not overused and common techniques. This paper is the first report of oil family typing using ANNs as robust computational methods. To this end, a self-organization map (SOM) neural network associated with three clustering validity indices were employed on oil samples belonging to the Iranian part of the Persian Gulf’ oilfields. For the SOM network, at first, ten default clusters were selected. Afterwards, three effective clustering validity coefficients, namely Calinski-Harabasz (CH), Silhouette indexes (SI) and Davies-Bouldin (DB), were operated to find the optimum number of clusters. Accordingly, among ten default clusters, the maximum CH (62) and SI (0.58) were acquired for four clusters. Likewise, the lowest DB (0.8) was obtained for four clusters. Thus, all three validation coefficients introduced four clusters as the optimum number of clusters or oil families. The number of oil families identified in the present report is consistent with those previously reported by other researchers in the same study area. However, the techniques used in the present paper, which have not been implemented so far, can be introduced as more straightforward for clustering purposes in the oil family typing than those of common and overused methods of PCA and HCA.


2020 ◽  
Vol 65 (6) ◽  
pp. 747-757
Author(s):  
Seyed Ayoob Noorbakhsh ◽  
Mahmood Mahmoodi-Eshkaftaki ◽  
Zahra Mokhtari

AbstractThe aim of this study was to develop and compare techniques to increase the prediction accuracy of patient mortality and organ dysfunction in the Intensive Care Units (hereinafter ICU) of hospitals. Patient mortality was estimated with two models of artificial neural network (ANN)-backpropagation (BP) and simplified acute physiology score (SAPS). Organ dysfunction was predicted by coupled ANN self-organizing map (SOM) and logistic organ dysfunction score (LODS) method on the basis of patient conditions. Input dataset consisted of 36 features recorded for 4,000 patients in the ICU. An integrated response surface methodology (RSM) and genetic algorithm (GA) was developed to achieve the best topology of the ANN-BP model. Although mortality prediction of the best ANN-BP (MSE = 0.0036, AUC = 0.83, R2 = 0.81) was more accurate than that of the SAPS score model (MSE = 0.0056, AUC = 0.82, R2 = 0.78), the execution time of the former (=45 min) was longer than that of the latter (=20 min). Therefore, the principal component analysis (PCA) was used to reduce the input feature dimensions, which, in turn, reduced the execution time up to 50%. Data reduction also helped to increase the network accuracy up to 90%. The likelihood of organ dysfunction determined by coupled ANN and scoring method technique can be much more efficient than the LODS model alone because the SOM could successfully classify the patients in 64 classes. The primary patient classification plays a major role in increasing the efficiency of an estimator.


2017 ◽  
Vol 14 (9) ◽  
pp. 095601 ◽  
Author(s):  
Huimin Sun ◽  
Yaoyong Meng ◽  
Pingli Zhang ◽  
Yajing Li ◽  
Nan Li ◽  
...  

2011 ◽  
Vol 467-469 ◽  
pp. 894-899
Author(s):  
Hong Men ◽  
Hai Yan Liu ◽  
Lei Wang ◽  
Yun Peng Pan

This paper presents an optimizing method of competitive neural network(CNN):During clustering analysis fixed on the optimum number of output neurons according to the change of DB value,and then adjusted connected weight including increasing ,dividing , delete. Each neuron had the different variety trend of learning rate according with the change of the probability of neurons. The optimizing method made classification more accurate. Simulation results showed that optimized network structure had a strong ability to adjust the number of clusters dynamically and good results of classification.


2014 ◽  
Vol 17 (1) ◽  
pp. 56-74 ◽  
Author(s):  
Gurjeet Singh ◽  
Rabindra K. Panda ◽  
Marc Lamers

The reported study was undertaken in a small agricultural watershed, namely, Kapgari in Eastern India having a drainage area of 973 ha. The watershed was subdivided into three sub-watersheds on the basis of drainage network and land topography. An attempt was made to relate the continuously monitored runoff data from the sub-watersheds and the whole-watershed with the rainfall and temperature data using the artificial neural network (ANN) technique. The reported study also evaluated the bias in the prediction of daily runoff with shorter length of training data set using different resampling techniques with the ANN modeling. A 10-fold cross-validation (CV) technique was used to find the optimum number of hidden neurons in the hidden layer and to avoid neural network over-fitting during the training process for shorter length of data. The results illustrated that the ANN models developed with shorter length of training data set avoid neural network over-fitting during the training process, using a 10-fold CV method. Moreover, the biasness was investigated using the bootstrap resampling technique based ANN (BANN) for short length of training data set. In comparison with the 10-fold CV technique, the BANN is more efficient in solving the problems of the over-fitting and under-fitting during training of models for shorter length of data set.


Sign in / Sign up

Export Citation Format

Share Document