Spatial Prediction of Groundwater Potentiality Mapping Using Machine Learning Algorithms

Abstract Machine learning techniques offer powerful tools for the assessment and management of groundwater resources. Here, we evaluated the groundwater potential maps (GWPMs) in Md. Bazar Block of Birbhum District, India using four GIS-based machine-learning algorithms (MLA) such as predictive neural network (PNN), decision tree (DT), Naïve Bayes classifier (NBC), and random forest (RF). We used a database of 85 dug wells and one piezometer location identified using extensive field study, and employed 12 influencing factors (elevation, slope, drainage density (DD), topographical wetness index, geomorphology, lineament density, rainfall, geology, pond density, land use/land cover (LULC), geology, and soil texture) for evaluation through GIS. The 85 dug wells and 1 piezometer locations were sub-divided into two classes: 70:30 for training and model validation. The DT, RF, PNN, and NBC MLAs were implemented to analyse the relationship between the dug well locations and groundwater influencing factors to generate GWPMs. The results predict excellent groundwater potential areas (GPA) DT RF of 17.38%, 14.69%, 20.43%, and 13.97% of the study area, respectively. The prediction accuracy of each GWPM was determined using a receiver operating characteristic (ROC) curve. Using the 30% data sets (validation data), accuracies of 80.1%, 78.30%, 75.20%, and 69.2% were obtained for the PNN, RF, DT, and NBC models, respectively. The ROC values show that the four implemented models provide satisfactory and suitable results for GWP mapping in this region. In addition, the well-known mean decrease Gini (MDG) from the RF MLA was implemented to determine the relative importance of the variables for groundwater potentiality assessment. The MDG revealed that drainage density, lineament density, geomorphology, pond density, elevation, and stream junction frequency were the most useful determinants of GWPM. Our approach to delineate the GWPM can aid in the effective planning and management of groundwater resources in this region.

Download Full-text

Integrated Geospatial Multi Influencing Factor Approach to Delineate and Identify Groundwater Potential Zones in Kabul Province, Afghanistan

10.21203/rs.3.rs-211720/v1 ◽

2021 ◽

Author(s):

Muhammad Jamal Nasir ◽

Sajjad Khan ◽

Tehreem Ayaz ◽

Amir Zeb Khan ◽

Waqas Ahmad ◽

...

Keyword(s):

Groundwater Resources ◽

Influencing Factor ◽

Relative Weight ◽

Drainage Density ◽

Groundwater Potential ◽

Land Use Land Cover ◽

Groundwater Potential Zones ◽

Lineament Density ◽

Study Results ◽

Groundwater Potentiality

Abstract This study was an attempt to evaluate the groundwater potentiality in Kabul province, Afghanistan using geospatial multi influencing factor (MIF) approach. The influencing parameters employed for the assessment of groundwater potential zones (GWPZ) includes slope, geology, soil, land use/land cover, lineament density, rainfall and drainage density. The sub-classes within each influencing parameter were sub-divided, based on their effectiveness in groundwater potentiality as major, minor and no effect, and subsequently assigned a score value. The combined score value of these parameters was used for calculating the relative weight. The delineated GWPZ were classified in four groups, i.e. poor, moderate, good and very good GWPZ. The study results revealed that very good GWPZ covered an area of 354.87km2 (2% of the total area), good 1523.86 km2 (20%), moderate 2250.99 km2 (73%) and poor 477.19 km2 (5%). The study concluded that geospatial assisted MIF approach was very useful and efficient techniques for the assessment of GWPZ and can be effectively employed to enhance the conceptual understanding of groundwater resources of Kabul Basin, Afghanistan.

Download Full-text

Developing groundwater potentiality models by coupling ensemble machine learning algorithms and statistical techniques for sustainable groundwater management

Geocarto International ◽

10.1080/10106049.2021.1987535 ◽

2021 ◽

pp. 1-21

Author(s):

Javed Mallick ◽

Mohd Waseem Naikoo ◽

Swapan Talukdar ◽

Ishita Afreen Ahmed ◽

Atiqur Rahman ◽

...

Keyword(s):

Machine Learning ◽

Groundwater Management ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Statistical Techniques ◽

Sustainable Groundwater Management ◽

Ensemble Machine Learning ◽

Groundwater Potentiality

Download Full-text

Integration of statistical models and ensemble machine learning algorithms (MLAs) for developing the novel hybrid groundwater potentiality models: a case study of semi-arid watershed in Saudi Arabia

Geocarto International ◽

10.1080/10106049.2021.1939439 ◽

2021 ◽

pp. 1-35

Author(s):

Javed Mallick ◽

Swapan Talukdar ◽

Majed Alsubih ◽

Mohammed K. Almesfer ◽

Shahfahad ◽

...

Keyword(s):

Machine Learning ◽

Saudi Arabia ◽

Statistical Models ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The Novel ◽

Ensemble Machine Learning ◽

Groundwater Potentiality ◽

Semi Arid

Download Full-text

Multivariate Analysis and Machine Learning Approach for Mapping the Variability and Vulnerability of Urban Flooding: The Case of Tangier City, Morocco

Hydrology ◽

10.3390/hydrology8040182 ◽

2021 ◽

Vol 8 (4) ◽

pp. 182

Author(s):

Tarik Bouramtane ◽

Ilias Kacimi ◽

Khalil Bouramtane ◽

Maryam Aziz ◽

Shiny Abraham ◽

...

Keyword(s):

Machine Learning ◽

Multivariate Analysis ◽

Population Density ◽

Learning Algorithms ◽

Drainage Density ◽

Machine Learning Algorithms ◽

Support Vector ◽

Urban Flooding ◽

Flood Vulnerability ◽

Urban Flood

Urban flooding is a complex natural hazard, driven by the interaction between several parameters related to urban development in a context of climate change, which makes it highly variable in space and time and challenging to predict. In this study, we apply a multivariate analysis method (PCA) and four machine learning algorithms to investigate and map the variability and vulnerability of urban floods in the city of Tangier, northern Morocco. Thirteen parameters that could potentially affect urban flooding were selected and divided into two categories: geo-environmental parameters and socio-economic parameters. PCA processing allowed identifying and classifying six principal components (PCs), totaling 73% of the initial information. The scores of the parameters on the PCs and the spatial distribution of the PCs allow to highlight the interconnection between the topographic properties and urban characteristics (population density and building density) as the main source of variability of flooding, followed by the relationship between the drainage (drainage density and distance to channels) and urban properties. All four machine learning algorithms show excellent performance in predicting urban flood vulnerability (ROC curve > 0.9). The Classifications and Regression Tree and Support Vector Machine models show the best prediction performance (ACC = 91.6%). Urban flood vulnerability maps highlight, on the one hand, low lands with a high drainage density and recent buildings, and on the other, higher, steep-sloping areas with old buildings and a high population density, as areas of high to very-high vulnerability.

Download Full-text

Application of Probabilistic and Machine Learning Models for Groundwater Potentiality Mapping in Damghan Sedimentary Plain, Iran

Remote Sensing ◽

10.3390/rs11243015 ◽

2019 ◽

Vol 11 (24) ◽

pp. 3015 ◽

Cited By ~ 7

Author(s):

Alireza Arabameri ◽

Jagabandhu Roy ◽

Sunil Saha ◽

Thomas Blaschke ◽

Omid Ghorbanzadeh ◽

...

Keyword(s):

Machine Learning ◽

Sensitivity Analysis ◽

Prediction Accuracy ◽

Goodness Of Fit ◽

Drainage Density ◽

Groundwater Potential ◽

Plain Area ◽

Groundwater Occurrence ◽

Groundwater Potentiality ◽

Very High

Groundwater is one of the most important natural resources, as it regulates the earth’s hydrological system. The Damghan sedimentary plain area, located in the region of a semi-arid climate of Iran, has very critical conditions of groundwater due to massive pressure on it and is in need of robust models for identifying the groundwater potential zones (GWPZ). The main goal of the current research is to prepare a groundwater potentiality map (GWPM) considering the probabilistic, machine learning, data mining, and multi-criteria decision analysis (MCDA) approaches. For this purpose, 80 wells collected from the Iranian groundwater resource department and field investigation with global positioning system (GPS), have been selected randomly and considered as the groundwater inventory datasets. Out of 80 wells, 56 (70%) wells have been brought into play for modeling and 24 (30%) for validation purposes. Elevation, slope, aspect, convergence index (CI), rainfall, drainage density (Dd), distance to river, distance to fault, distance to road, lithology, soil type, land use/land cover (LU/LC), normalized difference vegetation index (NDVI), topographic wetness index (TWI), topographic position index (TPI), and stream power index (SPI) have been used for modeling purpose. The area under the receiver operating characteristic (AUROC), sensitivity (SE), specificity (SP), accuracy (AC), mean absolute error (MAE), and root mean square error (RMSE) are used for checking the goodness-of-fit and prediction accuracy of approaches to compare their performance. In addition, the influence of groundwater determining factors (GWDFs) on groundwater occurrence was evaluated by performing a sensitivity analysis model. The GWPMs, produced by technique for order preference by similarity to ideal solution (TOPSIS), random forest (RF), binary logistic regression (BLR), weight of evidence (WoE) and support vector machine (SVM) have been classified into four categories, i.e., low, medium, high and very high groundwater potentiality with the help of the natural break classification methods in the GIS environment. The very high groundwater potentiality class is covered 15.09% for TOPSIS, 15.46% for WoE, 25.26% for RF, 15.47% for BLR, and 18.74% for SVM of the entire plain area. Based on sensitivity analysis, distance from river, and drainage density represent significantly effects on the groundwater occurrence. validation results show that the BLR model with best prediction accuracy and goodness-of-fit outperforms the other five models. Although, all models have very good performance in modeling of groundwater potential. Results of seed cell area index model that used for checking accuracy classification of models show that all models have suitable performance. Therefore, these are promising models that can be applied for the GWPZs identification, which will help for some needful action of these areas.

Download Full-text

Supplemental Material for One Model to Rule Them All? Using Machine Learning Algorithms to Determine the Number of Factors in Exploratory Factor Analysis

Psychological Methods ◽

10.1037/met0000262.supp ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Number Of Factors

Download Full-text

Forecasting US movies box office performances in Turkey using machine learning algorithms

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189120 ◽

2020 ◽

Vol 39 (5) ◽

pp. 6579-6590

Author(s):

Sandy Çağlıyor ◽

Başar Öztayşi ◽

Selime Sezgin

Keyword(s):

Machine Learning ◽

Global Economy ◽

Learning Algorithms ◽

Forecast Model ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

High Stakes ◽

Box Office ◽

Industry Forecast ◽

The Impact

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.

Download Full-text

Intelligent system of English composition scoring model based on improved machine learning algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189235 ◽

2020 ◽

pp. 1-11

Author(s):

Jie Liu ◽

Lin Lin ◽

Xiufang Liang

Keyword(s):

Machine Learning ◽

Evaluation System ◽

Intelligent System ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Assessment System ◽

English Composition ◽

Region Extraction ◽

Constraint Model

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.

Download Full-text

The Unlearnable Checkerboard Pattern

Communications of the Blyth Institute ◽

10.33014/issn.2640-5652.1.2.holloway.1 ◽

2019 ◽

Vol 1 (2) ◽

pp. 78-80

Author(s):

Eric Holloway

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Checkerboard Pattern ◽

Simple Task

Detecting some patterns is a simple task for humans, but nearly impossible for current machine learning algorithms. Here, the "checkerboard" pattern is examined, where human prediction nears 100% and machine prediction drops significantly below 50%.

Download Full-text