Detection of Loss Zones while Drilling Using Different Machine Learning Techniques

2021 ◽  
pp. 1-29
Author(s):  
Ahmed Alsaihati ◽  
Mahmoud Abughaban ◽  
Salaheldin Elkatatny ◽  
Abdulazeez Abdulraheem

Abstract Fluid loss into formations is a common operational issue that is frequently encountered when drilling across naturally or induced fractured formations. This could pose significant operational risks, such as well-control, stuck pipe, and wellbore instability, which, in turn, lead to an increase of well time and cost. This research aims to use and evaluate different machine learning techniques, namely: support vector machines, random forests, and K-nearest neighbors in detecting loss circulation occurrences while drilling using solely drilling surface parameters. Actual field data of seven wells, which had suffered partial or severe loss circulation, were used to build predictive models, while Well-8 was used to compare the performance of the developed models. Different performance metrics were used to evaluate the performance of the developed models. Recall, precision, and F1-score measures were used to evaluate the ability of the developed model to detect loss circulation occurrences. The results showed the K-nearest neighbors classifier achieved a high F1-score of 0.912 in detecting loss circulation occurrence in the testing set, while the random forests was the second-best classifier with almost the same F1-score of 0.910. The support vector machines achieved an F1-score of 0.83 in predicting the loss circulation occurrence in the testing set. The K-nearest neighbors outperformed other models in detecting the loss circulation occurrences in Well-8 with an F1-score of 0.80. The main contribution of this research as compared to previous studies is that it identifies losses events based on real-time measurements of the active pit volume.

2012 ◽  
Vol 2 (5) ◽  
pp. 269-272 ◽  
Author(s):  
K. Theofilatos ◽  
S. Likothanassis ◽  
A. Karathanasopoulos

The present paper aims in investigating the performance of state-of-the-art machine learning techniques in trading with the EUR/USD exchange rate at the ECB fixing. For this purpose, five supervised learning classification techniques (K-Nearest Neighbors algorithm, Naïve Bayesian Classifier, Artificial Neural Networks, Support Vector Machines and Random Forests) were applied in the problem of the one day ahead movement prediction of the EUR/USD exchange rate with only autoregressive terms as inputs. For comparison reasons, the performance of all machine learning techniques was benchmarked by two traditional techniques (Naïve  Strategy and moving average convergence/divergence model). Trading strategies produced by the machine learning techniques of Support Vector Machines and Random Forests clearly outperformed all other strategies in terms of annualized return and sharp ratio. To the best of our knowledge, this is the first application of Random Forests in the problem of trading with the EUR/USD exchange rate providing extremely satisfactory results.


2020 ◽  
Vol 24 (5) ◽  
pp. 1141-1160
Author(s):  
Tomás Alegre Sepúlveda ◽  
Brian Keith Norambuena

In this paper, we apply sentiment analysis methods in the context of the first round of the 2017 Chilean elections. The purpose of this work is to estimate the voting intention associated with each candidate in order to contrast this with the results from classical methods (e.g., polls and surveys). The data are collected from Twitter, because of its high usage in Chile and in the sentiment analysis literature. We obtained tweets associated with the three main candidates: Sebastián Piñera (SP), Alejandro Guillier (AG) and Beatriz Sánchez (BS). For each candidate, we estimated the voting intention and compared it to the traditional methods. To do this, we first acquired the data and labeled the tweets as positive or negative. Afterward, we built a model using machine learning techniques. The classification model had an accuracy of 76.45% using support vector machines, which yielded the best model for our case. Finally, we use a formula to estimate the voting intention from the number of positive and negative tweets for each candidate. For the last period, we obtained a voting intention of 35.84% for SP, compared to a range of 34–44% according to traditional polls and 36% in the actual elections. For AG we obtained an estimate of 37%, compared with a range of 15.40% to 30.00% for traditional polls and 20.27% in the elections. For BS we obtained an estimate of 27.77%, compared with the range of 8.50% to 11.00% given by traditional polls and an actual result of 22.70% in the elections. These results are promising, in some cases providing an estimate closer to reality than traditional polls. Some differences can be explained due to the fact that some candidates have been omitted, even though they held a significant number of votes.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Tom Elliot ◽  
Robert Morse ◽  
Duane Smythe ◽  
Ashley Norris

AbstractIt is 50 years since Sieveking et al. published their pioneering research in Nature on the geochemical analysis of artefacts from Neolithic flint mines in southern Britain. In the decades since, geochemical techniques to source stone artefacts have flourished globally, with a renaissance in recent years from new instrumentation, data analysis, and machine learning techniques. Despite the interest over these latter approaches, there has been variation in the quality with which these methods have been applied. Using the case study of flint artefacts and geological samples from England, we present a robust and objective evaluation of three popular techniques, Random Forest, K-Nearest-Neighbour, and Support Vector Machines, and present a pipeline for their appropriate use. When evaluated correctly, the results establish high model classification performance, with Random Forest leading with an average accuracy of 85% (measured through F1 Scores), and with Support Vector Machines following closely. The methodology developed in this paper demonstrates the potential to significantly improve on previous approaches, particularly in removing bias, and providing greater means of evaluation than previously utilised.


2019 ◽  
Vol 8 (3) ◽  
pp. 4500-4502

we develop a organized correlation of machine learning techniques connected to the issue of completely programmed acknowledgment of facial emotions. We investigate consequences on a progress of researches looking at acknowledgment engines, combining AdaBoost, support vector machines, linear discriminate analysis. We likewise investigated highlight choice strategies, including the utilization of AdaBoost for highlight choice before order through SVM or else LDA. Best outcomes are gotten through prefering a subset of Gabor conduit develop AdaBoost pursued through order with Support Vector Machines. The framework works continuously, within addition to got 93% right speculation novel matters intended for a 7-way compelled alternative going the Cohn-Kanade articulation information. The yields of the classier alteration easily an element of time and in this way can be utilized to gauge outward appearance elements. We connected the framework to fully automated recognition of facial activities (FACS). The current framework arranges 17 activity units, regardless of even those coming as one or else within combine with different activities, with a mean precision of 94.8%. The design fundamental consequences intended for applying this framework to facial emotions.


2019 ◽  
Vol 8 (4) ◽  
pp. 5813-5816

Now a days there is lots of data floating in the life of world access i.e Internet which is unstructured data.To manage this unstructured data we are introduced some classification algorithms in machine learning to classify the data.Sentiment Analysis[5] is contextual mining of text from documents ,reviews of customers which distinguishes and concentrates emotional data in source material. Assessment API works in fourteen unique dialects .We consider the issue of grouping records not by subject, however by generally speaking slant, e.g., deciding if an audit is certain or negative. Utilizing antiperspirants surveys as information, we locate that standard AI systems absolutely beat human-delivered baselines. The AI stratagies we connected with for arrangement are Naive Bayes, maximum entropy[2] classification, and support vector machines classification algorithms for sentiment classification as on traditional topic-based categorization.[1].


Author(s):  
Nick Guenther ◽  
Matthias Schonlau

Support vector machines are statistical- and machine-learning techniques with the primary goal of prediction. They can be applied to continuous, binary, and categorical outcomes analogous to Gaussian, logistic, and multinomial regression. We introduce a new command for this purpose, svmachines. This package is a thin wrapper for the widely deployed libsvm (Chang and Lin, 2011, ACM Transactions on Intelligent Systems and Technology 2(3): Article 27). We illustrate svmachines with two examples.


2019 ◽  
Vol 4 (2) ◽  
pp. 12-16 ◽  
Author(s):  
Allan Alves Pinheiro ◽  
Iago Modesto Brandao ◽  
Cesar Da Costa

This study proposes a method for diagnosing faults in turbomachines using machine learning techniques. In this study, a support vector machine-SVM algorithm is proposed for fault diagnosis of rotor rotation imbalance. Recently, support vector machines (SVMs) have become one of the most popular classification methods in vibration analysis technology. Axis unbalance defect is classified using support vector machines. The experimental data is derived from the turbomachine model of the rigid-shaft rotor and the flexible bearings, and the experimental setup for vibration analysis. Several situations of unbalance defects have been successfully detected.


2020 ◽  
Vol 7 ◽  
Author(s):  
Holmes Yesid Ayala-Yaguara ◽  
Gina Maribel Valenzuela-Sabogal ◽  
Alexander Espinosa-García

En el presente artículo se describe la obtención de un modelo de minería de datos aplicado al problema de la deserción universitaria en el programa de Ingeniería de Sistemas de la Universidad de Cundinamarca, extensión Facatativá. El modelo se estructuró mediante la metodología de minería de datos KDD (knowledge discovery in databases) haciendo uso del lenguaje de programación Python, la librería de procesamiento de datos Pandas y de machine learning Sklearn. Para el proceso se tuvieron en cuenta problemas adicionales al proceso de minería, como, por ejemplo, la alta dimensionalidad, por lo cual se aplicaron los métodos de selección de las variables estadístico univariado, feature importance y SelectFromModel (Sklearn). En el proyecto se seleccionaron cinco técnicas de minería de datos para evaluarlas: vecinos más cercanos (K nearest neighbors, KNN), árboles de decisión (decision tree, DT), árboles aleatorios (random forest, RF), regresión logística (logistic regression, LR) y máquinas de vectores soporte (support vector machines, SVM). Respecto a la selección del modelo final se evaluaron los resultados de cada modelo en las métricas de precisión, matriz de confusión y métricas adicionales de la matriz de confusión. Por último, se ajustaron los parámetros del modelo seleccionado y se evaluó la generalización del modelo al graficar su curva de aprendizaje.


Sign in / Sign up

Export Citation Format

Share Document