scholarly journals Clustering Scatter Plots Using Data Depth Measures

2013 ◽  
Vol 04 (03) ◽  
Author(s):  
Zhanpan Zhang
Keyword(s):  
2012 ◽  
Vol 6 (2) ◽  
pp. 102-115
Author(s):  
Zhanpan Zhang ◽  
Xinping Cui ◽  
Daniel R. Jeske ◽  
James Borneman
Keyword(s):  

2011 ◽  
Vol 17 (1) ◽  
pp. 26-32
Author(s):  
Arla Westenskow ◽  
Patricia S. Moyer-Packenham

A family dog supplies the measurements for scatter plots and variables so that students can explore relationships among data sets—not to mention paws and tails.


2017 ◽  
Vol 19 (3) ◽  
pp. 489-497

For any river basin management, one needs tools to predict runoff at different time and spatial resolutions. Hydrological models are tools which account for the storage, flow of water and water balance in a watershed, which include exchanges of water and energy within the earth, atmosphere and oceans and utilise metrological data to generate flow. There are several sources of error in meteorological data, namely, through measurement at point level, interpolation, etc. When an erroneous input is passed to a model, one cannot expect an error free output from the prediction. Every prediction is associated with uncertainty. Quantification of these uncertainties is of prime importance in real world forecasting. In this study, an attempt has been made to study uncertainty associated with hydrological modelling, using the idea of data depth. To see the effect of uncertainty in rainfall on flow generation through a model, the input to a model was altered by adding an error and a different realisation was made. A Monte Carlo simulation generated a large number of hydrological model parameter sets drawn from the uniform distribution. The model was run using these parameters for each realisation of the rainfall. The parameters which are good for different realisations are more likely to be good parameters sets. For each parameter set, data depth was calculated and a likelihood was assigned to each parameter set based on the depth values. Based on this, the frequency distribution of the likelihood was analysed as well. The results show that uncertainty in hydrological modelling are multiplicative. The proposed methodology to assign prediction uncertainty is demonstrated using the ‘TopNet’ model for the Waipara river catchment located in the central east of the South Island, New Zealand. The results of this study will be helpful in calibration of hydrological model and in quantifying uncertainty in the prediction.


Author(s):  
Gary Smith

Back in the 1980s, I talked to an economics professor who made forecasts for a large bank based on simple correlations like the one in Figure 1. If he wanted to forecast consumer spending, he made a scatter plot of income and spending and used a transparent ruler to draw a line that seemed to fit the data. If the scatter looked like Figure 1, then when income went up, he predicted that spending would go up. The problem with his simple scatter plots is that the world is not simple. Income affects spending, but so does wealth. What if this professor happened to draw his scatter plot using data from a historical period in which income rose (increasing spending) but the stock market crashed (reducing spending) and the wealth effect was more powerful than the income effect, so that spending declined, as in Figure 2? The professor’s scatter plot of spending and income will indicate that an increase in income reduces spending. Then, when he tries to forecast spending for a period when income and wealth both increase, his prediction of a decline in spending will be disastrously wrong. Multiple regression to the rescue. Multiple regression models have multiple explanatory variables. For example, a model of consumer spending might be: C = a + bY + cW where C is consumer spending, Y is household income, and W is wealth. The order in which the explanatory variables are listed does not matter. What does matter is which variables are included in the model and which are left out. A large part of the art of regression analysis is choosing explanatory variables that are important and ignoring those that are unimportant. The coefficient b measures the effect on spending of an increase in income, holding wealth constant, and c measures the effect on spending of an increase in wealth, holding income constant. The math for estimating these coefficients is complicated but the principle is simple: choose the estimates that give the best predictions of consumer spending for the data used to estimate the model. In Chapter 4, we saw that spurious correlations can appear when we compare variables like spending, income, and wealth that all tend to increase over time.


Stat ◽  
2018 ◽  
Vol 7 (1) ◽  
pp. e174 ◽  
Author(s):  
Subhabrata Majumdar ◽  
Snigdhansu Chatterjee
Keyword(s):  

2011 ◽  
Vol 42 (5) ◽  
pp. 356-371 ◽  
Author(s):  
András Bárdossy ◽  
Shailesh Kumar Singh

The parameters of hydrological models with no or short discharge records can only be estimated using regional information. We can assume that catchments with similar characteristics show a similar hydrological behaviour. A regionalization of hydrological model parameters on the basis of catchment characteristics is therefore plausible. However, due to the non-uniqueness of the rainfall/runoff model parameters (equifinality), a procedure of a regional parameter estimation by model calibration and a subsequent fit of a regional function is not appropriate. In this paper, a different procedure based on the depth function and convex combinations of model parameters is introduced. Catchment characteristics to be used for regionalization can be identified by the same procedure. Regionalization is then performed using different approaches: multiple linear regression using the deepest parameter sets and convex combinations. The assessment of the quality of the regionalized models is also discussed. An example of 28 British catchments illustrates the methodology.


2007 ◽  
Vol 101 (5) ◽  
pp. 379-383
Author(s):  
Eric Milou ◽  
Jay Schiffman

In many mathematics classes, students are asked to learn via the discovery method, in the hope that the intrinsic beauty of mathematics becomes more accessible and that making conjectures, forming hypotheses, and analyzing patterns will help them compute fluently and solve problems creatively and resourcefully (NCTM 2000). The activity discussed in this article was conducted with a group of preservice and inservice teachers, and the objectives included examining patterns, making conjectures, and using data analysis to construct scatter plots and tables, all in the spirit of discovering mathematics. This activity is based on a concept called the multiplicative digital root of an integer (Sloane 1973). Here we take the term integers, unless otherwise qualified, to mean positive integers.


Sign in / Sign up

Export Citation Format

Share Document