scholarly journals Benchmark products for land evapotranspiration: LandFlux-EVAL multi-data set synthesis

2013 ◽  
Vol 17 (10) ◽  
pp. 3707-3720 ◽  
Author(s):  
B. Mueller ◽  
M. Hirschi ◽  
C. Jimenez ◽  
P. Ciais ◽  
P. A. Dirmeyer ◽  
...  

Abstract. Land evapotranspiration (ET) estimates are available from several global data sets. Here, monthly global land ET synthesis products, merged from these individual data sets over the time periods 1989–1995 (7 yr) and 1989–2005 (17 yr), are presented. The merged synthesis products over the shorter period are based on a total of 40 distinct data sets while those over the longer period are based on a total of 14 data sets. In the individual data sets, ET is derived from satellite and/or in situ observations (diagnostic data sets) or calculated via land-surface models (LSMs) driven with observations-based forcing or output from atmospheric reanalyses. Statistics for four merged synthesis products are provided, one including all data sets and three including only data sets from one category each (diagnostic, LSMs, and reanalyses). The multi-annual variations of ET in the merged synthesis products display realistic responses. They are also consistent with previous findings of a global increase in ET between 1989 and 1997 (0.13 mm yr−2 in our merged product) followed by a significant decrease in this trend (−0.18 mm yr−2), although these trends are relatively small compared to the uncertainty of absolute ET values. The global mean ET from the merged synthesis products (based on all data sets) is 493 mm yr−1 (1.35 mm d−1) for both the 1989–1995 and 1989–2005 products, which is relatively low compared to previously published estimates. We estimate global runoff (precipitation minus ET) to 263 mm yr−1 (34 406 km3 yr−1) for a total land area of 130 922 000 km2. Precipitation, being an important driving factor and input to most simulated ET data sets, presents uncertainties between single data sets as large as those in the ET estimates. In order to reduce uncertainties in current ET products, improving the accuracy of the input variables, especially precipitation, as well as the parameterizations of ET, are crucial.

2013 ◽  
Vol 10 (1) ◽  
pp. 769-805 ◽  
Author(s):  
B. Mueller ◽  
M. Hirschi ◽  
C. Jimenez ◽  
P. Ciais ◽  
P. A. Dirmeyer ◽  
...  

Abstract. Land evapotranspiration (ET) estimates are available from several global datasets. Here, monthly global land ET synthesis products, merged from these individual datasets over the time periods 1989–1995 (7 yr) and 1989–2005 (17 yr), are presented. The merged synthesis products over the shorter period are based on a total of 40 distinct datasets while those over the longer period are based on a total of 14 datasets. In the individual datasets, ET is derived from satellite and/or in-situ observations (diagnostic datasets) or calculated via land-surface models (LSMs) driven with observations-based forcing and atmospheric reanalyses. Statistics for four merged synthesis products are provided, one including all datasets and three including only datasets from one category each (diagnostic, LSMs, and reanalyses). The multi-annual variations of ET in the merged synthesis products display realistic responses. They are also consistent with previous findings of a global increase in ET between 1989 and 1997 (1.15 mm yr−2 in our merged product) followed by a decrease in this trend (−1.40 mm yr−2), although these trends are relatively small compared to the uncertainty of absolute ET values. The global mean ET from the merged synthesis products (based on all datasets) is 1.35 mm per day for both the 1989–1995 and 1989–2005 products, which is relatively low compared to previously published estimates. We estimate global runoff (precipitation minus ET) to 34 406 km3 per year for a total land area of 130 922 km2. Precipitation, being an important driving factor and input to most simulated ET datasets, presents uncertainties between single datasets as large as those in the ET estimates. In order to reduce uncertainties in current ET products, improving the accuracy of the input variables, especially precipitation, as well as the parameterizations of ET are crucial.


2004 ◽  
Vol 13 (04) ◽  
pp. 863-880 ◽  
Author(s):  
VLADIMIR FILKOV ◽  
STEVEN SKIENA

With the exploding volume of microarray experiments comes increasing interest in mining repositories of such data. Meaningfully combining results from varied experiments on an equal basis is a challenging task. Here we propose a general method for integrating heterogeneous data sets based on the consensus clustering formalism. Our method analyzes source-specific clusterings and identifies a consensus set-partition which is as close as possible to all of them. We develop a general criterion to assess the potential benefit of integrating multiple heterogeneous data sets, i.e. whether the integrated data is more informative than the individual data sets. We apply our methods on two popular sets of microarray data yielding gene classifications of potentially greater interest than could be derived from the analysis of each individual data set.


2020 ◽  
Vol 122 (11) ◽  
pp. 1-32
Author(s):  
Michael A. Gottfried ◽  
Vi-Nhuan Le ◽  
J. Jacob Kirksey

Background It is of grave concern that kindergartners are missing more school than students in any other year of elementary school; therefore, documenting which students are absent and for how long is of upmost importance. Yet, doing so for students with disabilities (SWDs) has received little attention. This study addresses this gap by examining two cohorts of SWDs, separated by more than a decade, to document changes in attendance patterns. Research Questions First, for SWDs, has the number of school days missed or chronic absenteeism rates changed over time? Second, how are changes in the number of school days missed and chronic absenteeism rates related to changes in academic emphasis, presence of teacher aides, SWD-specific teacher training, and preschool participation? Subjects This study uses data from the Early Childhood Longitudinal Study (ECLS), a nationally representative data set of children in kindergarten. We rely on both ECLS data sets— the kindergarten classes of 1998–1999 and 2010–2011. Measures were identical in both data sets, making it feasible to compare children across the two cohorts. Given identical measures, we combined the data sets into a single data set with an indicator for being in the older cohort. Research Design This study examined two sets of outcomes: The first was number of days absent, and the second was likelihood of being chronically absent. These outcomes were regressed on a measure for being in the older cohort (our key measure for changes over time) and numerous control variables. The error term was clustered by classroom. Findings We found that SWDs are absent more often now than they were a decade earlier, and this growth in absenteeism was larger than what students without disabilities experienced. Absenteeism among SWDs was higher for those enrolled in full-day kindergarten, although having attended center-based care mitigates this disparity over time. Implications are discussed. Conclusions Our study calls for additional attention and supports to combat the increasing rates of absenteeism for SWDs over time. Understanding contextual shifts and trends in rates of absenteeism for SWDs in kindergarten is pertinent to crafting effective interventions and research geared toward supporting the academic and social needs of these students.


2018 ◽  
Vol 22 (12) ◽  
pp. 6241-6255 ◽  
Author(s):  
Soumendra N. Bhanja ◽  
Xiaokun Zhang ◽  
Junye Wang

Abstract. Groundwater is one of the most important natural resources for economic development and environmental sustainability. In this study, we estimated groundwater storage in 11 major river basins across Alberta, Canada, using a combination of remote sensing (Gravity Recovery and Climate Experiment, GRACE), in situ surface water data, and land surface modeling estimates (GWSAsat). We applied separate calculations for unconfined and confined aquifers, for the first time, to represent their hydrogeological differences. Storage coefficients for the individual wells were incorporated to compute the monthly in situ groundwater storage (GWSAobs). The GWSAsat values from the two satellite-based products were compared with GWSAobs estimates. The estimates of GWSAsat were in good agreement with the GWSAobs in terms of pattern and magnitude (e.g., RMSE ranged from 2 to 14 cm). While comparing GWSAsat with GWSAobs, most of the statistical analyses provide mixed responses; however the Hodrick–Prescott trend analysis clearly showed a better performance of the GRACE-mascon estimate. The results showed trends of GWSAobs depletion in 5 of the 11 basins. Our results indicate that precipitation played an important role in influencing the GWSAobs variation in 4 of the 11 basins studied. A combination of rainfall and snowmelt positively influences the GWSAobs in six basins. Water budget analysis showed an availability of comparatively lower terrestrial water in 9 of the 11 basins in the study period. Historical groundwater recharge estimates indicate a reduction of groundwater recharge in eight basins during 1960–2009. The output of this study could be used to develop sustainable water withdrawal strategies in Alberta, Canada.


2021 ◽  
Author(s):  
Oluwakemi Dare-Idowu ◽  
Lionel Jarlan ◽  
Aurore Brut ◽  
Valerie Le-Dantec ◽  
Vincent Rivalland ◽  
...  

<p>This study aims to analyze the main components of the energy and hydric budgets of irrigated maize in southwestern France. To this objective, the ISBA-A-gs (<span>Interactions between Soil, Biosphere, and Atmosphere) </span>is run over six maize growing seasons. As a preliminary step, the ability of the ISBA-A-gs model to predict the different terms of the energy and water budgets is assessed thanks to a large database of <em>in situ</em> measurements by comparing the single budget version of the model with the new Multiple Energy Balance version solving an energy budget separately for the soil and the vegetation. The <em>in situ</em> data set acquired at the Lamasquere site (43.48<sup>o</sup> N, 1.249<sup>o</sup> E) includes half-hourly measurements of sensible (H) and latent heat fluxes (LE) estimated by an Eddy Covariance system. Measurements also include net radiation (Rn), ground heat flux (G), plant transpiration with sap flow sensors, meteorological variables, and 15-days measurements of vegetation characteristics. The seasonal dynamics of the turbulent fluxes were properly reproduced by both configurations of the model with an R² ranging from 0.66 to 0.89, and a root mean square error lower than 48 W m<sup>-2</sup>. Statistical metrics showed that H was better predicted by MEB with R² of 0.80 in comparison to ISBA-Ags (0.73). However, the difference between the RMSE of ISBA-Ags and MEB during the well-developed stage of the plants for both H and LE does not exceed 8 W m<sup>-2</sup>. This implies that MEB only has a significant added value over ISBA-Ags when the soil and the canopy are not fully coupled, and over a heterogeneous field. Furthermore, this study made a comparison between the sap flow measurements and the transpiration simulated by ISBA-A-gs and MEB. A good dynamics was reproduced by ISBA-A-gs and MEB, although, MEB (R²= 0.91) provided a slightly more realistic estimation of the vegetation transpiration. Consequently, this study investigated the dynamics of the water budget during the growing maize seasons. Results indicated that drainage is almost null on the site, while the observed values of cumulative evapotranspiration that was higher than the water inputs are related to a shallow ground table that provides supplement water to the crop. This work provides insight into the modeling of water and energy exchanges over maize crops and opens perspectives for better water management of the crop in the future.</p>


Author(s):  
Li Yang ◽  
Qi Wang ◽  
Yu Rao

Abstract Film Cooling is an important and widely used technology to protect hot sections of gas turbines. The last decades witnessed a fast growth of research and publications in the field of film cooling. However, except for the correlations for single row film cooling and the Seller correlation for cooling superposition, there were rarely generalized models for film cooling under superposition conditions. Meanwhile, the numerous data obtained for complex hole distributions were not emerged or integrated from different sources, and recent new data had no avenue to contribute to a compatible model. The technical barriers that obstructed the generalization of film cooling models are: a) the lack of a generalizable model; b) the large number of input variables to describe film cooling. The present study aimed at establishing a generalizable model to describe multiple row film cooling under a large parameter space, including hole locations, hole size, hole angles, blowing ratios etc. The method allowed data measured within different streamwise lengths and different surface areas to be integrated in a single model, in the form 1-D sequences. A Long Short Term Memory model was designed to model the local behavior of film cooling. Careful training, testing and validation were conducted to regress the model. The presented results showed that the method was accurate within the CFD data set generated in this study. The presented method could serve as a base model that allowed past and future film cooling research to contribute to a common data base. Meanwhile, the model could also be transferred from simulation data sets to experimental data sets using advanced machine learning algorithms in the future.


2019 ◽  
Vol 622 ◽  
pp. A172 ◽  
Author(s):  
F. Murgas ◽  
G. Chen ◽  
E. Pallé ◽  
L. Nortmann ◽  
G. Nowak

Context. Rayleigh scattering in a hydrogen-dominated exoplanet atmosphere can be detected using ground- or space-based telescopes. However, stellar activity in the form of spots can mimic Rayleigh scattering in the observed transmission spectrum. Quantifying this phenomena is key to our correct interpretation of exoplanet atmospheric properties. Aims. We use the ten-meter Gran Telescopio Canarias (GTC) telescope to carry out a ground-based transmission spectra survey of extrasolar planets to characterize their atmospheres. In this paper we investigate the exoplanet HAT-P-11b, a Neptune-sized planet orbiting an active K-type star. Methods. We obtained long-slit optical spectroscopy of two transits of HAT-P-11b with the Optical System for Imaging and low-Intermediate-Resolution Integrated Spectroscopy (OSIRIS) on August 30, 2016 and September 25, 2017. We integrated the spectrum of HAT-P-11 and one reference star in several spectroscopic channels across the λ ~ 400–785 nm region, creating numerous light curves of the transits. We fit analytic transit curves to the data taking into account the systematic effects and red noise present in the time series in an effort to measure the change of the planet-to-star radius ratio (Rp∕Rs) across wavelength. Results. By fitting both transits together, we find a slope in the transmission spectrum showing an increase of the planetary radius towards blue wavelengths. Closer inspection of the transmission spectrum of the individual data sets reveals that the first transit presents this slope while the transmission spectrum of the second data set is flat. Additionally, we detect hints of Na absorption on the first night, but not on the second. We conclude that the transmission spectrum slope and Na absorption excess found in the first transit observation are caused by unocculted stellar spots. Modeling the contribution of unocculted spots to reproduce the results of the first night we find a spot filling factor of δ = 0.62−0.17+0.20 and a spot-to-photosphere temperature difference of ΔT = 429−299+184 K.


2019 ◽  
Vol 11 (23) ◽  
pp. 2736 ◽  
Author(s):  
Jueying Bai ◽  
Qian Cui ◽  
Wen Zhang ◽  
Lingkui Meng

A method is proposed for the production of downscaled soil moisture active passive (SMAP) soil moisture (SM) data by combining optical/infrared data with synthetic aperture radar (SAR) data based on the random forest (RF) model. The method leverages the sensitivity of active microwaves to surface SM and the triangle/trapezium feature space among vegetation indexes (VIs), land surface temperature (LST), and SM. First, five RF architectures (RF1–RF5) were trained and tested at 9 km. Second, a comparison was performed for RF1–RF5, and were evaluated against in situ SM measurements. Third, two SMAP-Sentinel active–passive SM products were compared at 3 km and 1 km using in situ SM measurements. Fourth, the RF5 model simulations were compared with the SMAP L2_SM_SP product based on the optional algorithm at 3 km and 1 km resolutions. The results showed that the downscaled SM based on the synergistic use of optical/infrared data and the backscatter at vertical–vertical (VV) polarization was feasible in semi-arid areas with relatively low vegetation cover. The RF5 model with backscatter and more parameters from optical/infrared data performed best among the five RF models and was satisfactory at both 3 km and 1 km. Compared with L2_SM_SP, RF5 was more superior at 1 km. The input variables in decreasing order of importance were backscatter, LST, VIs, and topographic factors over the entire study area. The low vegetation cover conditions probably amplified the importance of the backscatter and LST. A sufficient number of VIs can enhance the adaptability of RF models to different vegetation conditions.


2014 ◽  
Vol 31 (8) ◽  
pp. 1778-1789
Author(s):  
Hongkang Lin

Purpose – The clustering/classification method proposed in this study, designated as the PFV-index method, provides the means to solve the following problems for a data set characterized by imprecision and uncertainty: first, discretizing the continuous values of all the individual attributes within a data set; second, evaluating the optimality of the discretization results; third, determining the optimal number of clusters per attribute; and fourth, improving the classification accuracy (CA) of data sets characterized by uncertainty. The paper aims to discuss these issues. Design/methodology/approach – The proposed method for the solution of the clustering/classifying problem, designated as PFV-index method, combines a particle swarm optimization algorithm, fuzzy C-means method, variable precision rough sets theory, and a new cluster validity index function. Findings – This method could cluster the values of the individual attributes within the data set and achieves both the optimal number of clusters and the optimal CA. Originality/value – The validity of the proposed approach is investigated by comparing the classification results obtained for UCI data sets with those obtained by supervised classification BPNN, decision-tree methods.


Ocean Science ◽  
2019 ◽  
Vol 15 (2) ◽  
pp. 249-268 ◽  
Author(s):  
Johannes Schulz-Stellenfleth ◽  
Joanna Staneva

Abstract. In many coastal areas there is an increasing number and variety of observation data available, which are often very heterogeneous in their temporal and spatial sampling characteristics. With the advent of new systems, like the radar altimeter on board the Sentinel-3A satellite, a lot of questions arise concerning the accuracy and added value of different instruments and numerical models. Quantification of errors is a key factor for applications, like data assimilation and forecast improvement. In the past, the triple collocation method to estimate systematic and stochastic errors of measurements and numerical models was successfully applied to different data sets. This method relies on the assumption that three independent data sets provide estimates of the same quantity. In coastal areas with strong gradients even small distances between measurements can lead to larger differences and this assumption can become critical. In this study the triple collocation method is extended in different ways with the specific problems of the coast in mind. In addition to nearest-neighbour approximations considered so far, the presented method allows for use of a large variety of interpolation approaches to take spatial variations in the observed area into account. Observation and numerical model errors can therefore be estimated, even if the distance between the different data sources is too large to assume that they measure the same quantity. If the number of observations is sufficient, the method can also be used to estimate error correlations between certain data source components. As a second novelty, an estimator for the uncertainty in the derived observation errors is derived as a function of the covariance matrices of the input data and the number of available samples. In the first step, the method is assessed using synthetic observations and Monte Carlo simulations. The technique is then applied to a data set of Sentinel-3A altimeter measurements, in situ wave observations, and numerical wave model data with a focus on the North Sea. Stochastic observation errors for the significant wave height, as well as bias and calibration errors, are derived for the model and the altimeter. The analysis indicates a slight overestimation of altimeter wave heights, which become more pronounced at higher sea states. The smallest stochastic errors are found for the in situ measurements. Different observation geometries of in situ data and altimeter tracks are furthermore analysed, considering 1-D and 2-D interpolation approaches. For example, the geometry of an altimeter track passing between two in situ wave instruments is considered with model data being available at the in situ locations. It is shown that for a sufficiently large sample, the errors of all data sources, as well as the error correlations of the model, can be estimated with the new method.


Sign in / Sign up

Export Citation Format

Share Document