Application of Machine Learning for Lithology-on-Bit Prediction using Drilling Data in Real-Time

2021 ◽  
Author(s):  
Temirlan Zhekenov ◽  
Artem Nechaev ◽  
Kamilla Chettykbayeva ◽  
Alexey Zinovyev ◽  
German Sardarov ◽  
...  

SUMMARY Researchers base their analysis on basic drilling parameters obtained during mud logging and demonstrate impressive results. However, due to limitations imposed by data quality often present during drilling, those solutions often tend to lose their stability and high levels of predictivity. In this work, the concept of hybrid modeling was introduced which allows to integrate the analytical correlations with algorithms of machine learning for obtaining stable solutions consistent from one data set to another.

2021 ◽  
Author(s):  
S. H. Al Gharbi ◽  
A. A. Al-Majed ◽  
A. Abdulraheem ◽  
S. Patil ◽  
S. M. Elkatatny

Abstract Due to high demand for energy, oil and gas companies started to drill wells in remote areas and unconventional environments. This raised the complexity of drilling operations, which were already challenging and complex. To adapt, drilling companies expanded their use of the real-time operation center (RTOC) concept, in which real-time drilling data are transmitted from remote sites to companies’ headquarters. In RTOC, groups of subject matter experts monitor the drilling live and provide real-time advice to improve operations. With the increase of drilling operations, processing the volume of generated data is beyond a human's capability, limiting the RTOC impact on certain components of drilling operations. To overcome this limitation, artificial intelligence and machine learning (AI/ML) technologies were introduced to monitor and analyze the real-time drilling data, discover hidden patterns, and provide fast decision-support responses. AI/ML technologies are data-driven technologies, and their quality relies on the quality of the input data: if the quality of the input data is good, the generated output will be good; if not, the generated output will be bad. Unfortunately, due to the harsh environments of drilling sites and the transmission setups, not all of the drilling data is good, which negatively affects the AI/ML results. The objective of this paper is to utilize AI/ML technologies to improve the quality of real-time drilling data. The paper fed a large real-time drilling dataset, consisting of over 150,000 raw data points, into Artificial Neural Network (ANN), Support Vector Machine (SVM) and Decision Tree (DT) models. The models were trained on the valid and not-valid datapoints. The confusion matrix was used to evaluate the different AI/ML models including different internal architectures. Despite the slowness of ANN, it achieved the best result with an accuracy of 78%, compared to 73% and 41% for DT and SVM, respectively. The paper concludes by presenting a process for using AI technology to improve real-time drilling data quality. To the author's knowledge based on literature in the public domain, this paper is one of the first to compare the use of multiple AI/ML techniques for quality improvement of real-time drilling data. The paper provides a guide for improving the quality of real-time drilling data.


2001 ◽  
Vol 4 (06) ◽  
pp. 489-501 ◽  
Author(s):  
D. Kandel ◽  
R. Quagliaroli ◽  
G. Segalini ◽  
B. Barraud

Summary The acquisition of gas in mud data while drilling for geological surveillance and safety is an almost universal practice. This source of data is only rarely used for formation evaluation because of the widely accepted presumption that it is unreliable and unrepresentative. Recent developments in the mud-logging industry to improve gas data acquisition and analysis have led to the availability of better quality data. Within a joint Elf/Eni-Agip Div. research program, a new interpretation method has been developed following the comprehensive analysis and interpretation of gas data from a wide range of wells covering different types of geological, petroleum, and drilling environments. The results, validated by correlation and comparison with other data such as logs, well tests, and pressure/volume temperature (PVT) data, enable us to characterize lithological changes; porosity variations and permeability barriers; seal depth, thickness, and efficiency; gas diffusion or leakage; gas/oil and hydrocarbon/water contacts; vertical changes in fluid over a thick monolayer pay zone; vertical fluid differentiation in multilayer intervals; and biodegradation. The comparison of surface gas, PVT, and geochemistry data clearly confirms the consistency between the drilling gas data (gas shows) and the corresponding reservoir fluid composition. The near real-time availability, at no extra acquisition cost, of such data has led to:The optimization of future well operations (such as logging and testing).A better integration of while-drilling data to the well evaluation process.A significant improvement in both early formation evaluation and reservoir studies, especially for the following applications, in which traditional log analysis often remains inconclusive:Very-low-porosity reservoirs.Thin beds.Dynamic barriers and seal efficiency.Low-resistivity pay.Light hydrocarbons. Examples show gas while drilling (GWD) wellsite quicklook interpretations with simple lithological and fluid interpretations, as well as more complex reservoir and fluid characterization applications in varied geographical and geological contexts; both demonstrate how GWD data are integrated with more standard data sets. Introduction The measurement of gas shows is standard practice during the drilling of exploration and development wells. Continuous gas monitoring sometimes enables us to indicate, in general terms, the presence of hydrocarbon-bearing intervals, but it rarely allows us to define the fluid types (oil, condensate and/or gas, and water). Gas data are at present largely underused because they are considered unreliable and not fully representative of the formation fluids. There are many reasons for this. On one hand, poorly established correlations exist between reservoir fluids and shows at surface; on the other hand, numerous drilling parameters strongly influence the recorded gas data, such as formation pressure, mud weight and type, gas-trap position in the shaker ditch, and mud-out temperatures. One reason may be the very low cost of such data, often equated with low value. Until a few years ago, the analysis performed on gas shows was generally restricted to the use of Pixler and/or Geoservices diagrams (or equivalent), wetness, balance, character, and gas normalization.1–4 Recent improvements in gas-acquisition technology and the new GWD methodology allow us to perform reservoir interpretation in near real time for fluid identification and contacts [oil/water contact (OWC), gas/oil contact (GOC), etc.], lithological changes, and barrier efficiency, thus allowing operations optimization (e.g., coring, wireline recording and sampling, and testing operations). It is also possible to integrate the GWD interpretation in reservoir, geochemical, PVT analysis, and comprehensive studies. Method Data Acquisition. The measurement of gas shows in the circulating drilling mud was introduced in the early days of mud logging (ML) with two objectives: first, as a safety device to indicate well behavior to drillers, and second, as an indicator of hydrocarbon-bearing zones. Today, gas-shows measurement is systematically acquired in the petroleum industry for the same reason, but it is seldom used to its full potential, mainly because of an ongoing prejudice that the data are not representative of the formation fluids and/or that the recording of these data is strongly influenced by varying drilling parameters. The ML gas system is composed of three parts:A "gas trap" to extract gas from the mud stream situated somewhere between the bell nipple and the shaker box (often in the latter).Lines, pumps, and filters enabling the transport of a dry-gas sample to the ML unit.A detection system in the ML unit. Recent efforts in the mud-logging industry to improve gas-data acquisition and analysis have led to the availability of better quality data, which has provided reliable lithological and fluid information since the 1990s. In the 1980s, most of the ML companies introduced the flame ionization detectors (FID) to replace previous total gas (TG) and chromatograph measurements. The TG measurement gives the total amount of hydrocarbon components extracted from the mud and burned in the detector. The TG could now be correlated with the C1-C5 readings from the new breed of chromatographs.5 Finally, over the past few years, several ML companies have introduced fast-gas chromatographs with improved resolution (C1-C5 in less than 1 minute), improved C1/C2 separation, and, above all, improved reliability and repeatability. High-speed chromatographs using a thermal-conductivity detector have also appeared on the market, but they were not tested within this project. Work carried out by Texaco in the early 1990s led to a significant improvement in basic trap design with the introduction of the quantitative gas measurement (QGM) trap, which was a major step in reducing the effect of environmental changes.6 An alternative proposition from Geoservices was to replace the trap, generally situated in the shaker box, with a pumping system supplying the trap with a constant volume of mud sucked from a probe situated close in the flowline to the bell nipple.7


2021 ◽  
pp. 1-13
Author(s):  
Hany Gamal ◽  
Ahmed Alsaihati ◽  
Salaheldin Elkatatny

Abstract The sonic data provides significant rock properties that are commonly used for designing the operational programs for drilling, rock fracturing, and development operations. The conventional methods for acquiring the rock sonic data in terms of compressional and shear slowness (ΔTc and ΔTs) are considered costly and time-consuming operations. The target of this paper is to proposed machine learning models for predicting the sonic logs from the drilling data in real-time. Decision tree (DT) and random forest (RF) were employed as train-based algorithms for building the sonic prediction models for drilling complex lithology rocks that have limestone, sandstone, shale, and carbonate formations. The input data for the models include the surface drilling parameters to predict the shear and compressional slowness. The study employed data set of 2888 data points for building and testing the model, while another collected 2863 data set was utilized for further validation for the sonic models. Sensitivity investigations were performed for DT and RF models to confirm optimal accuracy. The correlation of coefficient (R), and average absolute percentage error (AAPE) were used to check the models' accuracy between the actual values and models` outputs, in addition to, the sonic log profiles. The results indicated that the developed sonic models have a high capability for the sonic prediction from the drilling data as DT model recorded R higher than 0.967 and AAPE less than 2.76% for ΔTc and ΔTs models, while RF showed R higher than 0.991 with AAPE less than 1.07%. The further validation process for the developed models indicated the great results for the sonic prediction and RF model outperformed DT models as RF showed R higher than 0.986 with AAPE less than 1.12% while DT prediction recorded R greater than 0.93 with AAPE less than 1.95%. The sonic prediction through the developed models will save the cost and time for acquiring the sonic data through the conventional methods and will provide real-time estimation from the drilling parameters.


2022 ◽  
pp. 1-14
Author(s):  
Salem Al-Gharbi ◽  
Abdulaziz Al-Majed ◽  
Salaheldin Elkatatny ◽  
Abdulazeez Abdulraheem

Abstract Due to high demand for energy, oil and gas companies started to drill wells in remote environments conducting unconventional operations. In order to maintain safe, fast and more cost-effective operations, utilizing machine learning (ML) technologies has become a must. The harsh environments of drilling sites and the transmission setups, are negatively affecting the drilling data, leading to less than acceptable ML results. For that reason, big portion of ML development projects were actually spent on improving the data by data-quality experts. The objective of this paper is to evaluate the effectiveness of ML on improving the real-time drilling-data-quality and compare it to a human expert knowledge. To achieve that, two large real-time drilling datasets were used; one dataset was used to train three different ML techniques: artificial neural network (ANN), support vector machine (SVM) and decision tree (DT), the second dataset was used to evaluate it. The ML results were compared with the results of a real- time drilling data quality expert. Despite the complexity of ANN and good results in general, it achieved a relative root mean square error (RRMSE) of 2.83%, which was lower than DT and SVM technologies that achieved RRMSE of 0.35% and 0.48% respectively. The uniqueness of this work is in developing ML that simulates the improvement of drilling-data- quality by an expert. This research provides a guide for improving the quality of real-time drilling data.


2021 ◽  
Author(s):  
Ahmed Al-Sabaa ◽  
Hany Gamal ◽  
Salaheldin Elkatatny

Abstract The formation porosity of drilled rock is an important parameter that determines the formation storage capacity. The common industrial technique for rock porosity acquisition is through the downhole logging tool. Usually logging while drilling, or wireline porosity logging provides a complete porosity log for the section of interest, however, the operational constraints for the logging tool might preclude the logging job, in addition to the job cost. The objective of this study is to provide an intelligent prediction model to predict the porosity from the drilling parameters. Artificial neural network (ANN) is a tool of artificial intelligence (AI) and it was employed in this study to build the porosity prediction model based on the drilling parameters as the weight on bit (WOB), drill string rotating-speed (RS), drilling torque (T), stand-pipe pressure (SPP), mud pumping rate (Q). The novel contribution of this study is to provide a rock porosity model for complex lithology formations using drilling parameters in real-time. The model was built using 2,700 data points from well (A) with 74:26 training to testing ratio. Many sensitivity analyses were performed to optimize the ANN model. The model was validated using unseen data set (1,000 data points) of Well (B), which is located in the same field and drilled across the same complex lithology. The results showed the high performance for the model either for training and testing or validation processes. The overall accuracy for the model was determined in terms of correlation coefficient (R) and average absolute percentage error (AAPE). Overall, R was higher than 0.91 and AAPE was less than 6.1 % for the model building and validation. Predicting the rock porosity while drilling in real-time will save the logging cost, and besides, will provide a guide for the formation storage capacity and interpretation analysis.


2021 ◽  
Author(s):  
Aurore Lafond ◽  
Maurice Ringer ◽  
Florian Le Blay ◽  
Jiaxu Liu ◽  
Ekaterina Millan ◽  
...  

Abstract Abnormal surface pressure is typically the first indicator of a number of problematic events, including kicks, losses, washouts and stuck pipe. These events account for 60–70% of all drilling-related nonproductive time, so their early and accurate detection has the potential to save the industry billions of dollars. Detecting these events today requires an expert user watching multiple curves, which can be costly, and subject to human errors. The solution presented in this paper is aiming at augmenting traditional models with new machine learning techniques, which enable to detect these events automatically and help the monitoring of the drilling well. Today’s real-time monitoring systems employ complex physical models to estimate surface standpipe pressure while drilling. These require many inputs and are difficult to calibrate. Machine learning is an alternative method to predict pump pressure, but this alone needs significant labelled training data, which is often lacking in the drilling world. The new system combines these approaches: a machine learning framework is used to enable automated learning while the physical models work to compensate any gaps in the training data. The system uses only standard surface measurements, is fully automated, and is continuously retrained while drilling to ensure the most accurate pressure prediction. In addition, a stochastic (Bayesian) machine learning technique is used, which enables not only a prediction of the pressure, but also the uncertainty and confidence of this prediction. Last, the new system includes a data quality control workflow. It discards periods of low data quality for the pressure anomaly detection and enables to have a smarter real-time events analysis. The new system has been tested on historical wells using a new test and validation framework. The framework runs the system automatically on large volumes of both historical and simulated data, to enable cross-referencing the results with observations. In this paper, we show the results of the automated test framework as well as the capabilities of the new system in two specific case studies, one on land and another offshore. Moreover, large scale statistics enlighten the reliability and the efficiency of this new detection workflow. The new system builds on the trend in our industry to better capture and utilize digital data for optimizing drilling.


2021 ◽  
Author(s):  
Kriti Singh ◽  
Sai Yalamarty ◽  
Curtis Cheatham ◽  
Khoa Tran ◽  
Greg McDonald

Abstract This paper is a follow up to the URTeC (2019-343) publication where the training of a Machine Learning (ML) model to predict rate of penetration (ROP) is described. The ML model gathers recent drilling parameters and approximates drilling conditions downhole to predict ROP. In real time, the model is run through an optimization sweep by adjusting parameters which can be controlled by the driller. The optimal drilling parameters and modeled ROP are then displayed for the driller to utilize. The ML model was successfully deployed and tested in real time in collaboration with leading shale operators in the Permian Basin. The testing phase was split in two parts, preliminary field tests and trials of the end-product. The key learnings from preliminary field tests were used to develop an integrated driller's dashboard with optimal drilling parameters recommendations and situational awareness tools for high dysfunction and procedural compliance which was used for designed trials. The results of field trials are discussed where subject well ROP was improved between 19-33% when comparing against observation/control footage. The overall ROP on subject wells was also compared against offset wells with similar target formations, BHAs, and wellbore trajectories. In those comparisons against qualified offsets, ROP was improved by as little as 5% and as much as 33%. In addition to comparing ROP performance, results from post-run data analysis are also presented. Detailed drilling data analytics were performed to check if using the recommendations during the trial caused any detrimental effects such as divergence in directional trends or high lateral or axial vibrations. The results from this analysis indicate that the measured downhole axial and lateral vibrations were in the safe zone. Also, no significant deviations in rotary trends were observed.


2021 ◽  
pp. 1-21
Author(s):  
Hany Gamal ◽  
Ahmed Alsaihati ◽  
Salaheldin Elkatatny ◽  
Saleh Haidary ◽  
Abdulazeez Abdulraheem

Abstract The rock unconfined compressive strength (UCS) is one of the key parameters for geomechanical and reservoir modeling in the petroleum industry. Obtaining the UCS by conventional methods such as experimental work or empirical correlation from logging data are time consuming and highly cost. To overcome these drawbacks, this paper utilized the help of artificial intelligence (AI) to predict (in a real-time) the rock strength from the drilling parameters using two AI tools. Random forest (RF) based on principal component analysis (PCA), and functional network (FN) techniques were employed to build two UCS prediction models based on the drilling data such as weight on bit (WOB), drill string rotating-speed (RS), drilling torque (T), stand-pipe pressure (SPP), mud pumping rate (Q), and the rate of penetration (ROP). The models were built using 2,333 data points from well (A) with 70:30 training to testing ratio. The models were validated using unseen data set (1,300 data points) of Well (B) which is located in the same field and drilled across the same complex lithology. The results of the PCA-based RF model outperformed the FN in terms of correlation coefficient (R) and average absolute percentage error (AAPE). The overall accuracy for PCA-based RF was R of 0.99 and AAPE of 4.3 %, and for FN yielded R of 0.97 and AAPE of 8.5%. The validation results showed that R was 0.99 for RF and 0.96 for FN, while the AAPE was 4 and 7.9 % for RF and FN models, respectively. The developed PCA-based RF and FN models provide an accurate UCS estimation in real-time from the drilling data, saving time and cost and enhancing the well stability by generating UCS log from the rig drilling data.


2011 ◽  
Vol 20 (04) ◽  
pp. 753-781
Author(s):  
KAI CHEN ◽  
KIA MAKKI ◽  
NIKI PISSINOU

In the metropolitan region, most congestion or traffic jams are caused by the uneven distribution of traffic flow that creates bottleneck points where the traffic volume exceeds the road capacity. Additionally, unexpected incidents are the next most probable cause of these bottleneck regions. Moreover, most drivers are driving based on their empirical experience without awareness of real-time traffic situations. This unintelligent traffic behavior can make the congestion problem worse. Prediction based route guidance systems show great improvements in solving the inefficient diversion strategy problem by estimating future travel time when calculating accurate travel time is difficult. However, performances of machine learning based prediction models that are based on the historical data set degrade sharply during a congestion situation. This paper develops a new navigation system for reducing travel time of an individual driver and distributing the flow of urban traffic efficiently in order to reduce the occurrence of congestion. Compared with previous route guidance systems, the results reveal that our system, applying the advanced multi-lane prediction based real-time fastest path (AMPRFP) algorithm, can significantly reduce the travel time especially when drivers travel in a complex route environment and face frequent congestion problems. Unlike the previous system,1 it can be applied either for single lane or multi-lane urban traffic networks where the reason for congestion is significantly complex. We also demonstrate the advantages of this system and verify the results using real highway traffic data and a synthetic experiment.


This research discloses how to utilize machine learning methods for anomaly detection in real-time on a computer network. While utilizing machine learning for this task is definitely not a novel idea, little literature is about the matter of doing it in real-time. Most machine learning research in PC network anomaly detection depends on the KDD '99 data set and means to demonstrate the proficiency of the algorithms introduced. The emphasis on this data set has caused a lack of scientific papers disclosing how to assemble network data, remove features, and train algorithms for use inreal-time networks. It has been contended that utilizing the KDD '99 dataset for anomaly detection is not appropriate for real-time network systems. This research proposes how the data gathering procedure will be possible utilizing a dummy network and generating synthetic network traffic by analyzing the importance of One-class SVM. As the efficiency of k-means clustering and LTSM neural networks is lower than one-class SVM, that is why this research uses the results of existing research of LSTM and k-means clustering for the comparison with reported outcomes of a similar algorithm on the KDD '99 dataset. Precisely, without engaging KDD ’99 data set by using synthetic network traffic, this research achieved the higher accuracy as compared to the previous researches.


Sign in / Sign up

Export Citation Format

Share Document