Groundwater pollution monitoring and the inverse problem of source identification. Evaluation of various Machine Learning methods

Author(s):  
Yiannis Kontos ◽  
Theodosios Kassandros ◽  
Konstantinos Katsifarakis ◽  
Kostas Karatzas

<p>Groundwater pollution numerical simulations coupled with Genetic Algorithms (GAs) lead to vast computational load, while flow fields’ simplification can compensate in design, but not real-time/operational, applications. Various Machine Learning/Deep Learning (ML/DL) methods/problem-formulations were tested/evaluated for real-time inverse problems of aquifer pollution source identification. Aim: investigate data-driven approaches towards replacing flow simulation with ML/DL trained models identifying the source, faster but efficiently enough.</p><p>Steady flow in a 1500mx1500m theoretical confined, isotropic aquifer of known characteristics is studied. Two pumping wells (PWs) near the southern boundary provide irrigation/drinking water, defining the flow together with a varying North-South natural flow. Six suspected possible sources, capable of instantaneous leakage, may spread a conservative pollutant. Particle tracking simulates advective mass transport, in a 2D flow-field for 2500 1-day timesteps. The 14x14 inner field grid nodes serve as locations of sources, PWs and monitoring wells (MWs; for simple daily yes/no pollution detection and/or drawdown measuring). 15,246 combinations of 6 Source Nrs, 21 N-S hydraulic gradients, 11+11 PW1,2 flow-rates were simulated with existing own software, providing the necessary data-sets for ML training/evaluation.</p><p>Two basic ML/DL approaches were implemented: Classification (CL) and Computer Vision (CV). In CL, every source is a discrete class, while each MW is a discrete variable. The target variable Y can equal 1 to 6, while input variables X can be: a) 0/1 (MW<sub>i</sub> polluted or not), b) the first day of MW<sub>i</sub>’s pollution, c) the duration of MW<sub>i</sub>’s pollution, d) hydraulic drawdown of MW<sub>i</sub>. For a bit more realism, the two southern rows of 28 MWs, and the MWs on/around PWs are concealed. CL features the advantage of facilitating Correlation-based Feature Subset Selection (CFSS), indirectly leading to a pseudo-optimization of the monitoring network, minimizing the number of MWs (not the sampling frequency though), based solely on the efficiency in identifying the source criterion. As a downside, time dimension and spatial correlation of MWs are not considered. Approach (b) being the best scheme, Random Forests (RFs; 86.5576% accuracy), Multi-Layer Perceptron (MLP; 77.5%), and Nearest Neighbors (NN; 86.5%) were tested. CFSS led to 8 only MWs being important, so training with the optimal subsets gave promising results: RF=85.4%, MLP=73.1%, NN=85.4%. In CV, MW<sub>i</sub>s’ pollution input data on a 10-day basis (0-60, 800-on concealed) were formulated into 14x14-pixel black/white images, that is 14x14 binary (0,1) matrices, the t=0 image being the desideratum. A Convolutional Neural Network (CNN; U-Net architecture for image segmentation) achieved 97.1% accuracy. A Convolutional Long/Short-Term Memory Neural Network (CLSTM), training a model to back-propagate predicting each given time step, with unchanged data formulation (60-800d, step 10), exhibits 82.3% accuracy. CLSTM’s performance is timestep-sensitive, best results yielded (98% accuracy) using configuration 5-800d, step 6.</p><p>Concluding, CL’s CFSS minimizes the input space, while CV approaches yield more promising results in terms of accuracy. Each approach has certain constraints in operational applicability, concerning the number of MWs, the sampling resolution and the total elapsed time. This process paves the way for realistic inverse problem solutions, ML-GAs monitoring network optimization, and real-time pollution detection operational systems. </p>

2019 ◽  
Vol 2019 ◽  
pp. 1-11 ◽  
Author(s):  
Xu Luo ◽  
Jun Yang

Detecting pollution timely and locating the pollution source is of great importance in environmental protection. Considering advantages of the sensor network technology, sensor networks have been adopted in pollution monitoring works. In this paper, a survey on researches of pollution monitoring using sensor networks in environment protection is given. Firstly, sensors and pollution monitoring network systems are studied. Secondly, different pollution detection methods are analyzed and compared. Thirdly, an overview of state-of-art technologies on pollution source localization is given. Finally, challenges on pollution monitoring using sensor networks are presented.


Geophysics ◽  
2020 ◽  
Vol 85 (6) ◽  
pp. R477-R492 ◽  
Author(s):  
Bingbing Sun ◽  
Tariq Alkhalifah

Full-waveform inversion (FWI) is a nonlinear optimization problem, and a typical optimization algorithm such as the nonlinear conjugate gradient or limited-memory Broyden-Fletcher-Goldfarb-Shanno (LBFGS) would iteratively update the model mainly along the gradient-descent direction of the misfit function or a slight modification of it. Based on the concept of meta-learning, rather than using a hand-designed optimization algorithm, we have trained the machine (represented by a neural network) to learn an optimization algorithm, entitled the “ML-descent,” and apply it in FWI. Using a recurrent neural network (RNN), we use the gradient of the misfit function as the input, and the hidden states in the RNN incorporate the history information of the gradient similar to an LBFGS algorithm. However, unlike the fixed form of the LBFGS algorithm, the machine-learning (ML) version evolves in response to the gradient. The loss function for training is formulated as a weighted summation of the L2 norm of the data residuals in the original inverse problem. As with any well-defined nonlinear inverse problem, the optimization can be locally approximated by a linear convex problem; thus, to accelerate the training, we train the neural network by minimizing randomly generated quadratic functions instead of performing time-consuming FWIs. To further improve the accuracy and robustness, we use a variational autoencoder that projects and represents the model in latent space. We use the Marmousi and the overthrust examples to demonstrate that the ML-descent method shows faster convergence and outperforms conventional optimization algorithms. The energy in the deeper part of the models can be recovered by the ML-descent even when the pseudoinverse of the Hessian is not incorporated in the FWI update.


Mathematics ◽  
2019 ◽  
Vol 7 (10) ◽  
pp. 890 ◽  
Author(s):  
Zhihao Zhang ◽  
Zhe Wu ◽  
David Rincon ◽  
Panagiotis Christofides

Machine learning has attracted extensive interest in the process engineering field, due to the capability of modeling complex nonlinear process behavior. This work presents a method for combining neural network models with first-principles models in real-time optimization (RTO) and model predictive control (MPC) and demonstrates the application to two chemical process examples. First, the proposed methodology that integrates a neural network model and a first-principles model in the optimization problems of RTO and MPC is discussed. Then, two chemical process examples are presented. In the first example, a continuous stirred tank reactor (CSTR) with a reversible exothermic reaction is studied. A feed-forward neural network model is used to approximate the nonlinear reaction rate and is combined with a first-principles model in RTO and MPC. An RTO is designed to find the optimal reactor operating condition balancing energy cost and reactant conversion, and an MPC is designed to drive the process to the optimal operating condition. A variation in energy price is introduced to demonstrate that the developed RTO scheme is able to minimize operation cost and yields a closed-loop performance that is very close to the one attained by RTO/MPC using the first-principles model. In the second example, a distillation column is used to demonstrate an industrial application of the use of machine learning to model nonlinearities in RTO. A feed-forward neural network is first built to obtain the phase equilibrium properties and then combined with a first-principles model in RTO, which is designed to maximize the operation profit and calculate optimal set-points for the controllers. A variation in feed concentration is introduced to demonstrate that the developed RTO scheme can increase operation profit for all considered conditions.


Sensors ◽  
2019 ◽  
Vol 19 (4) ◽  
pp. 804 ◽  
Author(s):  
Sagar Shelke ◽  
Baris Aksanli

Convergence of Machine Learning, Internet of Things, and computationally powerful single-board computers has boosted research and implementation of smart spaces. Smart spaces make predictions based on historical data to enhance user experience. In this paper, we present a low-cost, low-energy smart space implementation to detect static and dynamic human activities that require simple motions. We use low-resolution (4 × 16) and non-intrusive thermal sensors to collect data. We train six machine learning algorithms, namely logistic regression, naive Bayes, support vector machine, decision tree, random forest and artificial neural network (vanilla feed-forward) on the dataset collected in our lab. Our experiments reveal a very high static activity detection rate with all algorithms, where the feed-forward neural network method gives the best accuracy of 99.96%. We also show how data collection methods and sensor placement plays an important role in the resulting accuracy of different machine learning algorithms. To detect dynamic activities in real time, we use cross-correlation and connected components of thermal images. Our smart space implementation, with its real-time properties, can be used in various domains and applications, such as conference room automation, elderly health-care, etc.


Electronics ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 170
Author(s):  
Muhammad Wasimuddin ◽  
Khaled Elleithy ◽  
Abdelshakour Abuzneid ◽  
Miad Faezipour ◽  
Omar Abuzaghleh

Cardiovascular diseases have been reported to be the leading cause of mortality across the globe. Among such diseases, Myocardial Infarction (MI), also known as “heart attack”, is of main interest among researchers, as its early diagnosis can prevent life threatening cardiac conditions and potentially save human lives. Analyzing the Electrocardiogram (ECG) can provide valuable diagnostic information to detect different types of cardiac arrhythmia. Real-time ECG monitoring systems with advanced machine learning methods provide information about the health status in real-time and have improved user’s experience. However, advanced machine learning methods have put a burden on portable and wearable devices due to their high computing requirements. We present an improved, less complex Convolutional Neural Network (CNN)-based classifier model that identifies multiple arrhythmia types using the two-dimensional image of the ECG wave in real-time. The proposed model is presented as a three-layer ECG signal analysis model that can potentially be adopted in real-time portable and wearable monitoring devices. We have designed, implemented, and simulated the proposed CNN network using Matlab. We also present the hardware implementation of the proposed method to validate its adaptability in real-time wearable systems. The European ST-T database recorded with single lead L3 is used to validate the CNN classifier and achieved an accuracy of 99.23%, outperforming most existing solutions.


Atmosphere ◽  
2019 ◽  
Vol 10 (6) ◽  
pp. 318 ◽  
Author(s):  
Zihan Huang ◽  
Qi Yu ◽  
Yujie Liu ◽  
Weichun Ma ◽  
Limin Chen

Dense air quality monitoring network (AQMN) is one of main ways to surveil industrial air pollution. This paper is concerned with the design of a dense AQMN for H2S for a chemical industrial park in Shanghai, China. An indicator (Surveillance Efficiency, SE) for the long-term performance of AQMN was constructed by averaging pollution detection efficiency (rd) and source identification efficiency (rb). A ranking method was developed by combing Gaussian puff model and Source area analysis for improving calculation efficiency. Candidate combinations with highest score were given priority in the selection of next site. Two existing monitors were suggested to relocate to the west and southwest of this park. SE of optimized AQMN increased quickly with monitor number, and then the growth trend started to flatten when the number reached about 60. The highest SE occurred when the number reached 110. Optimal schemes of AQMNs were suggested which can achieve about 98% of the highest SE, while using only about 60 monitors. Finally, the reason why the highest SE is less than 1 and the variation characteristics of rd and rb were discussed. Overall, the proposed method is an effective tool for designing AQMN with optimal SE in industrial parks.


2020 ◽  
Vol 16 (4) ◽  
Author(s):  
Rohit Verma ◽  
Saumil Maheshwari ◽  
Anupam Shukla

AbstractObjectivesThe appropriate care for patients admitted in Intensive care units (ICUs) is becoming increasingly prominent, thus recognizing the use of machine learning models. The real-time prediction of mortality of patients admitted in ICU has the potential for providing the physician with the interpretable results. With the growing crisis including soaring cost, unsafe care, misdirected care, fragmented care, chronic diseases and evolution of epidemic diseases in the domain of healthcare demands the application of automated and real-time data processing for assuring the improved quality of life. The intensive care units (ICUs) are responsible for generating a wealth of useful data in the form of Electronic Health Record (EHR). This data allows for the development of a prediction tool with perfect knowledge backing.MethodWe aimed to build the mortality prediction model on 2012 Physionet Challenge mortality prediction database of 4,000 patients admitted in ICU. The challenges in the dataset, such as high dimensionality, imbalanced distribution and missing values, were tackled with analytical methods and tools via feature engineering and new variable construction. The objective of the research is to utilize the relations among the clinical variables and construct new variables which would establish the effectiveness of 1-Dimensional Convolutional Neural Network (1-D CNN) with constructed features.ResultsIts performance with the traditional machine learning algorithms like XGBoost classifier, Light Gradient Boosting Machine (LGBM) classifier, Support Vector Machine (SVM), Decision Tree (DT), K-Neighbours Classifier (K-NN), and Random Forest Classifier (RF) and recurrent models like Long Short-Term Memory (LSTM) and LSTM-attention is compared for Area Under Curve (AUC). The investigation reveals the best AUC of 0.848 using 1-D CNN model.ConclusionThe relationship between the various features were recognized. Also, constructed new features using existing ones. Multiple models were tested and compared on different metrics.


Sign in / Sign up

Export Citation Format

Share Document