Machine Learning Analysis of Hydrologic Exchange Flows and Transit Time Distributions in a Large Regulated River

Frontiers in Artificial Intelligence ◽

10.3389/frai.2021.648071 ◽

2021 ◽

Vol 4 ◽

Author(s):

Huiying Ren ◽

Xuehang Song ◽

Yilin Fang ◽

Z. Jason Hou ◽

Timothy D. Scheibe

Keyword(s):

Machine Learning ◽

Transit Time ◽

Numerical Models ◽

Data Availability ◽

Regulated River ◽

Exchange Flows ◽

Extreme Gradient Boosting ◽

Hydrologic Exchange ◽

River Corridor ◽

River Velocity

Hydrologic exchange between river channels and adjacent subsurface environments is a key process that influences water quality and ecosystem function in river corridors. High-resolution numerical models were often used to resolve the spatial and temporal variations of exchange flows, which are computationally expensive. In this study, we adopt Random Forest (RF) and Extreme Gradient Boosting (XGB) approaches for deriving reduced order models of hydrologic exchange flows and associated transit time distributions, with integrated field observations (e.g., bathymetry) and hydrodynamic simulation data (e.g., river velocity, depth). The setup allows an improved understanding of the influences of various physical, spatial, and temporal factors on the hydrologic exchange flows and transit times. The predictors also contain those derived using hybrid clustering, leveraging our previous work on river corridor system hydromorphic classification. The machine learning-based predictive models are developed and validated along the Columbia River Corridor, and the results show that the top parameters are the thickness of the top geological formation layer, the flow regime, river velocity, and river depth; the RF and XGB models can achieve 70% to 80% accuracy and therefore are effective alternatives to the computational demanding numerical models of exchange flows and transit time distributions. Each machine learning model with its favorable configuration and setup have been evaluated. The transferability of the models to other river reaches and larger scales, which mostly depends on data availability, is also discussed.

Download Full-text

Scale-dependent spatial variabilities of hydrological exchange flows and transit time in a large regulated river

Journal of Hydrology ◽

10.1016/j.jhydrol.2021.126283 ◽

2021 ◽

Vol 598 ◽

pp. 126283

Author(s):

Xuehang Song ◽

Yilin Fang ◽

Jie Bao ◽

Huiying Ren ◽

Zhuoran Duan ◽

...

Keyword(s):

Transit Time ◽

Regulated River ◽

Exchange Flows ◽

Hydrological Exchange

Download Full-text

Dam Operations and Subsurface Hydrogeology Control Dynamics of Hydrologic Exchange Flows in a Regulated River Reach

10.1002/essoar.10500054.1 ◽

2018 ◽

Cited By ~ 1

Author(s):

Pin Shuai ◽

Xingyuan Chen ◽

Xuehang Song ◽

Glenn Hammond ◽

John Zachara ◽

...

Keyword(s):

Regulated River ◽

Exchange Flows ◽

River Reach ◽

Hydrologic Exchange

Download Full-text

River Dynamics Control Transit Time Distributions and Biogeochemical Reactions in a Dam‐Regulated River Corridor

Water Resources Research ◽

10.1029/2019wr026470 ◽

2020 ◽

Vol 56 (9) ◽

Cited By ~ 1

Author(s):

Xuehang Song ◽

Xingyuan Chen ◽

John M. Zachara ◽

Jesus D. Gomez‐Velez ◽

Pin Shuai ◽

...

Keyword(s):

Transit Time ◽

Regulated River ◽

River Dynamics ◽

River Corridor

Download Full-text

Dam Operations and Subsurface Hydrogeology Control Dynamics of Hydrologic Exchange Flows in a Regulated River Reach

Water Resources Research ◽

10.1029/2018wr024193 ◽

2019 ◽

Vol 55 (4) ◽

pp. 2593-2612 ◽

Cited By ~ 7

Author(s):

Pin Shuai ◽

Xingyuan Chen ◽

Xuehang Song ◽

Glenn E. Hammond ◽

John Zachara ◽

...

Keyword(s):

Regulated River ◽

Exchange Flows ◽

River Reach ◽

Hydrologic Exchange

Download Full-text

Kilometer‐Scale Hydrologic Exchange Flows in a Gravel Bed River Corridor and Their Implications to Solute Migration

Water Resources Research ◽

10.1029/2019wr025258 ◽

2020 ◽

Vol 56 (2) ◽

Cited By ~ 4

Author(s):

John M. Zachara ◽

Xingyuan Chen ◽

Xuehang Song ◽

Pin Shuai ◽

Chris Murray ◽

...

Keyword(s):

Exchange Flows ◽

Gravel Bed ◽

Gravel Bed River ◽

Hydrologic Exchange ◽

River Corridor

Download Full-text

Predicting Undesired Treatment Outcome in Mental Healthcare: Machine Learning Study (Preprint)

10.2196/preprints.17235 ◽

2019 ◽

Author(s):

Kasper Van Mens ◽

Joran Lokkerbol ◽

Richard Janssen ◽

Robert de Lange ◽

Bea Tiemens

Keyword(s):

Machine Learning ◽

Treatment Outcome ◽

Mental Health Treatment ◽

Mental Healthcare ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Trade Off ◽

Trade Offs ◽

Outcome Monitoring ◽

Extreme Gradient Boosting

BACKGROUND It remains a challenge to predict which treatment will work for which patient in mental healthcare. OBJECTIVE In this study we compare machine algorithms to predict during treatment which patients will not benefit from brief mental health treatment and present trade-offs that must be considered before an algorithm can be used in clinical practice. METHODS Using an anonymized dataset containing routine outcome monitoring data from a mental healthcare organization in the Netherlands (n = 2,655), we applied three machine learning algorithms to predict treatment outcome. The algorithms were internally validated with cross-validation on a training sample (n = 1,860) and externally validated on an unseen test sample (n = 795). RESULTS The performance of the three algorithms did not significantly differ on the test set. With a default classification cut-off at 0.5 predicted probability, the extreme gradient boosting algorithm showed the highest positive predictive value (ppv) of 0.71(0.61 – 0.77) with a sensitivity of 0.35 (0.29 – 0.41) and area under the curve of 0.78. A trade-off can be made between ppv and sensitivity by choosing different cut-off probabilities. With a cut-off at 0.63, the ppv increased to 0.87 and the sensitivity dropped to 0.17. With a cut-off of at 0.38, the ppv decreased to 0.61 and the sensitivity increased to 0.57. CONCLUSIONS Machine learning can be used to predict treatment outcomes based on routine monitoring data.This allows practitioners to choose their own trade-off between being selective and more certain versus inclusive and less certain.

Download Full-text

Evaluation of Three Different Machine Learning Methods for Object-Based Artificial Terrace Mapping—A Case Study of the Loess Plateau, China

Remote Sensing ◽

10.3390/rs13051021 ◽

2021 ◽

Vol 13 (5) ◽

pp. 1021

Author(s):

Hu Ding ◽

Jiaming Na ◽

Shangjing Jiang ◽

Jie Zhu ◽

Kai Liu ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Loess Plateau ◽

Water Conservation ◽

Nearest Neighbor ◽

Gradient Boosting ◽

K Nearest Neighbor ◽

The Loess Plateau ◽

Object Based ◽

Extreme Gradient Boosting

Artificial terraces are of great importance for agricultural production and soil and water conservation. Automatic high-accuracy mapping of artificial terraces is the basis of monitoring and related studies. Previous research achieved artificial terrace mapping based on high-resolution digital elevation models (DEMs) or imagery. As a result of the importance of the contextual information for terrace mapping, object-based image analysis (OBIA) combined with machine learning (ML) technologies are widely used. However, the selection of an appropriate classifier is of great importance for the terrace mapping task. In this study, the performance of an integrated framework using OBIA and ML for terrace mapping was tested. A catchment, Zhifanggou, in the Loess Plateau, China, was used as the study area. First, optimized image segmentation was conducted. Then, features from the DEMs and imagery were extracted, and the correlations between the features were analyzed and ranked for classification. Finally, three different commonly-used ML classifiers, namely, extreme gradient boosting (XGBoost), random forest (RF), and k-nearest neighbor (KNN), were used for terrace mapping. The comparison with the ground truth, as delineated by field survey, indicated that random forest performed best, with a 95.60% overall accuracy (followed by 94.16% and 92.33% for XGBoost and KNN, respectively). The influence of class imbalance and feature selection is discussed. This work provides a credible framework for mapping artificial terraces.

Download Full-text

A Machine Learning Method for Predicting Vegetation Indices in China

Remote Sensing ◽

10.3390/rs13061147 ◽

2021 ◽

Vol 13 (6) ◽

pp. 1147

Author(s):

Xiangqian Li ◽

Wenping Yuan ◽

Wenjie Dong

Keyword(s):

Machine Learning ◽

Growing Season ◽

Crop Growth ◽

Spatiotemporal Distribution ◽

Coefficient Of Determination ◽

Gradient Boosting ◽

Severe Drought ◽

Vegetation Growth ◽

Extreme Gradient Boosting ◽

Boosting Method

To forecast the terrestrial carbon cycle and monitor food security, vegetation growth must be accurately predicted; however, current process-based ecosystem and crop-growth models are limited in their effectiveness. This study developed a machine learning model using the extreme gradient boosting method to predict vegetation growth throughout the growing season in China from 2001 to 2018. The model used satellite-derived vegetation data for the first month of each growing season, CO2 concentration, and several meteorological factors as data sources for the explanatory variables. Results showed that the model could reproduce the spatiotemporal distribution of vegetation growth as represented by the satellite-derived normalized difference vegetation index (NDVI). The predictive error for the growing season NDVI was less than 5% for more than 98% of vegetated areas in China; the model represented seasonal variations in NDVI well. The coefficient of determination (R2) between the monthly observed and predicted NDVI was 0.83, and more than 69% of vegetated areas had an R2 > 0.8. The effectiveness of the model was examined for a severe drought year (2009), and results showed that the model could reproduce the spatiotemporal distribution of NDVI even under extreme conditions. This model provides an alternative method for predicting vegetation growth and has great potential for monitoring vegetation dynamics and crop growth.

Download Full-text

Corn Nitrogen Status Diagnosis with an Innovative Multi-Parameter Crop Circle Phenom Sensing System

Remote Sensing ◽

10.3390/rs13030401 ◽

2021 ◽

Vol 13 (3) ◽

pp. 401

Author(s):

Cadan Cummings ◽

Yuxin Miao ◽

Gabriel Dias Paiao ◽

Shujiang Kang ◽

Fabián G. Fernández

Keyword(s):

Machine Learning ◽

Chlorophyll Content ◽

Vegetation Index ◽

Soil Drainage ◽

Management Information ◽

Area Index ◽

Sensing System ◽

Extreme Gradient Boosting ◽

Split Plot ◽

N Status

Accurate and non-destructive in-season crop nitrogen (N) status diagnosis is important for the success of precision N management (PNM). Several active canopy sensors (ACS) with two or three spectral wavebands have been used for this purpose. The Crop Circle Phenom sensor is a new integrated multi-parameter proximal ACS system for in-field plant phenomics with the capability to measure reflectance, structural, and climatic attributes. The objective of this study was to evaluate this multi-parameter Crop Circle Phenom sensing system for in-season diagnosis of corn (Zea mays L.) N status across different soil drainage and tillage systems under variable N supply conditions. The four plant metrics used to approximate in-season N status consist of aboveground biomass (AGB), plant N concentration (PNC), plant N uptake (PNU), and N nutrition index (NNI). A field experiment was conducted in Wells, Minnesota during the 2018 and the 2019 growing seasons with a split-split plot design replicated four times with soil drainage (drained and undrained) as main block, tillage (conventional, no-till, and strip-till) as split plot, and pre-plant N (PPN) rate (0 to 225 in 45 kg ha−1 increment) as the split-split plot. Crop Circle Phenom measurements alongside destructive whole plant samples were collected at V8 +/−1 growth stage. Proximal sensor metrics were used to construct regression models to estimate N status indicators using simple regression (SR) and eXtreme Gradient Boosting (XGB) models. The sensor derived indices tested included normalized difference vegetation index (NDVI), normalized difference red edge (NDRE), estimated canopy chlorophyll content (eCCC), estimated leaf area index (eLAI), ratio vegetation index (RVI), canopy chlorophyll content index (CCCI), fractional photosynthetically active radiation (fPAR), and canopy and air temperature difference (ΔTemp). Management practices such as drainage, tillage, and PPN rate were also included to determine the potential improvement in corn N status diagnosis. Three of the four replicated drained and undrained blocks were randomly selected as training data, and the remaining drained and undrained blocks were used as testing data. The results indicated that SR modeling using NDVI would be sufficient for estimating AGB compared to more complex machine learning methods. Conversely, PNC, PNU, and NNI all benefitted from XGB modeling based on multiple inputs. Among different approaches of XGB modeling, combining management information and Crop Circle Phenom measurements together increased model performance for predicting each of the four plant N metrics compared with solely using sensing data. The PPN rate was the most important management metric for all models compared to drainage and tillage information. Combining Crop Circle Phenom sensor parameters and management information is a promising strategy for in-season diagnosis of corn N status. More studies are needed to further evaluate this new integrated sensing system under diverse on-farm conditions and to test other machine learning models.

Download Full-text

Machine Learning Approach for Predicting Lane-Change Maneuvers using the SHRP2 Naturalistic Driving Study Data

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/03611981211003581 ◽

2021 ◽

pp. 036119812110035

Author(s):

Anik Das ◽

Mohamed M. Ahmed

Keyword(s):

Machine Learning ◽

Prediction Accuracy ◽

Machine Learning Algorithms ◽

Support Vector ◽

Lane Change ◽

Adaptive Boosting ◽

Extreme Gradient Boosting ◽

Naturalistic Driving Study ◽

Naturalistic Driving ◽

Change Prediction

Accurate lane-change prediction information in real time is essential to safely operate Autonomous Vehicles (AVs) on the roadways, especially at the early stage of AVs deployment, where there will be an interaction between AVs and human-driven vehicles. This study proposed reliable lane-change prediction models considering features from vehicle kinematics, machine vision, driver, and roadway geometric characteristics using the trajectory-level SHRP2 Naturalistic Driving Study and Roadway Information Database. Several machine learning algorithms were trained, validated, tested, and comparatively analyzed including, Classification And Regression Trees (CART), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Adaptive Boosting (AdaBoost), Support Vector Machine (SVM), K Nearest Neighbor (KNN), and Naïve Bayes (NB) based on six different sets of features. In each feature set, relevant features were extracted through a wrapper-based algorithm named Boruta. The results showed that the XGBoost model outperformed all other models in relation to its highest overall prediction accuracy (97%) and F1-score (95.5%) considering all features. However, the highest overall prediction accuracy of 97.3% and F1-score of 95.9% were observed in the XGBoost model based on vehicle kinematics features. Moreover, it was found that XGBoost was the only model that achieved a reliable and balanced prediction performance across all six feature sets. Furthermore, a simplified XGBoost model was developed for each feature set considering the practical implementation of the model. The proposed prediction model could help in trajectory planning for AVs and could be used to develop more reliable advanced driver assistance systems (ADAS) in a cooperative connected and automated vehicle environment.

Download Full-text