global regression
Recently Published Documents


TOTAL DOCUMENTS

64
(FIVE YEARS 30)

H-INDEX

13
(FIVE YEARS 3)

2022 ◽  
Author(s):  
Adam Slez

While quantitative methods are routinely used to examine historical materials, critics take issue with the use of global regression models that attach a single parameter to each predictor, thereby ignoring the effects of time and space, which together define the context in which historical events unfold. This problem can be addressed by allowing for parameter heterogeneity, as highlighted by the proliferation of work on the use of time-varying parameter models. In this paper, I show how this approach can be extended to the case of spatial data using spatially-varying coefficient models, with an eye toward the study of electoral politics, where the use of spatial data is especially common in historical settings. Toward this end, I revisit a critical case in the field of quantitative history: the rise of electoral Populism in the American West in the period between 1890 and 1896. Upending popular narratives about the correlates of third- party support in the late nineteenth century, I show that the association between third- party vote share and traditional predictors such as economic hardship and ethnic composition varied considerably from one place to the next, giving rise to distinct varieties of electoral Populism—a finding that is missed by global models, which mistake the mathematically particular for the historically general. These findings have important theoretical and empirical implications for the study of political action in a world where parameter heterogeneity is increasingly recognized as a standard feature of modern social science.


2022 ◽  
Vol 11 (1) ◽  
pp. 42
Author(s):  
Mingyang Du ◽  
Xuefeng Li ◽  
Mei-Po Kwan ◽  
Jingzong Yang ◽  
Qiyang Liu

Understanding the spatiotemporal variation of high-efficiency ride-hailing orders (HROs) is helpful for transportation network companies (TNCs) to balance the income of drivers through reasonable order dispatch, and to alleviate the imbalance between supply and demand by improving the pricing mechanism, so as to promote the sustainable and healthy development of the ride-hailing industry and urban transportation. From the perspective of TNCs for order management, this study investigates the spatiotemporal variation of HROs and common ride-hailing orders (CROs) for ride-hailing services using the trip data of Didi Chuxing in Haikou, China. Ordinary least squares (OLS) and geographically weighted regression (GWR) models are established to examine the factors that affect the densities of HROs and CROs during different time periods, such as morning, evening, afternoon and night, with considering various built environment variables. The OLS models show that factors including road density, average travel time rate, companies and enterprises and transportation facilities have significant impacts on HROs and CROs for most periods. The results of the GWR models are consistent with the global regression results and show the local effects of the built environment on HROs and CROs in different regions.


Land ◽  
2021 ◽  
Vol 10 (11) ◽  
pp. 1226
Author(s):  
Shanaka Herath

Estimating the non-market monetary values of urban amenities has become commonplace in urban planning research, particularly following Rosen’s seminal article on hedonic theory in 1974. As a revealed preference method, the hedonic approach decouples the market price of a house into price components that are attributable to housing characteristics. Despite the potential contribution of this theory in a planning context, three main limitations exist in the conventional applications: (1) variable measurement issues, (2) model misspecification, and (3) the problematic common use of global regression. These flaws problematically skew our understanding of the urban structure and spatial distribution of amenities, leading to misinformed policy interventions and poor amenity planning decisions. In this article, we propose a coherent conceptual framework that addresses measurement, specification, and scale challenges to generate consistent economic estimates of local amenities. Finally, we argue that, by paying greater attention to the spatial equity of amenity values, governments can provide greater equality of opportunities in cities.


2021 ◽  
Vol 15 (4) ◽  
pp. 117-127
Author(s):  
Zubairul Islam ◽  
Sudhir Kumar Singh

The main objective was to explore the connection between flood and drought hazards and their impact on crop land and human migration. The Flood and Drought effect on Cropland Index (FDCI), hot spot analysis and the Global Regression Analysis method was applied for the identification of the relationship between human migration and flood and drought hazards. The spatial pattern and hot and cold spots of FDCI, spatial autocorrelation and Getis-OrdGi* statistic techniques were used respectively. The FDCI was taken as an explanatory variable and human migration was taken as a dependent variable in the environment of the geographically weighted regression (GWR) model which was applied to measure the impact of flood and drought hazards on human migration. FDCI suggests a z-score of 4.9, which shows that the impact of flood and drought frequency on crop land is highly clustered. In the case of the hot spots analysis, out of seventy districts in Uttar Pradesh twenty-one were classified as hot spot and eight were classified as cold spots with a confidence level of 90 to 99%. Hot spot indicate maximum and cold spots show minimum impact of flood and drought hazards on crop land. The impact of flood and drought hazards on human migration show that there are fourteen districts where migration out is far more than predicted while there are ten districts where migration out is far lower.


2021 ◽  
Vol 11 (20) ◽  
pp. 9424
Author(s):  
Guanwei Zhao ◽  
Zhitao Li ◽  
Muzhuang Yang

The spatial decomposition of demographic data at a fine resolution is a classic and crucial problem in the field of geographical information science. The main objective of this study was to compare twelve well-known machine learning regression algorithms for the spatial decomposition of demographic data with multisource geospatial data. Grid search and cross-validation methods were used to ensure that the optimal model parameters were obtained. The results showed that all the global regression algorithms used in the study exhibited acceptable results, besides the ordinary least squares (OLS) algorithm. In addition, the regularization method and the subsetting method were both useful for alleviating overfitting in the OLS model, and the former was better than the latter. The more competitive performance of the nonlinear regression algorithms than the linear regression algorithms implies that the relationship between population density and influence factors is likely to be non-linear. Among the global regression algorithms used in the study, the best results were achieved by the k-nearest neighbors (KNN) regression algorithm. In addition, it was found that multi-sources geospatial data can improve the accuracy of spatial decomposition results significantly, and thus the proposed method in our study can be applied to the study of spatial decomposition in other areas.


2021 ◽  
Vol 13 (20) ◽  
pp. 4046
Author(s):  
Victor Pryamitsyn ◽  
Boris Petrenko ◽  
Alexander Ignatov ◽  
Yury Kihai

The first full-mission global AVHRR FRAC sea surface temperature (SST) dataset with a nominal 1.1km resolution at nadir was produced from three Metop First Generation (FG) satellites: Metop-A (2006-on), -B (2012-on) and -C (2018-on), using the NOAA Advanced Clear Sky Processor for Ocean (ACSPO) SST enterprise system. Historical reprocessing (‘Reanalysis-1’, RAN1) starts at the beginning of each mission and continues into near-real time (NRT). ACSPO generates two SST products, one with global regression (GR; highly sensitive to skin SST), and another one with piecewise regression (PWR; proxy for depth SST) algorithms. Small residual effects of orbital and sensor instabilities on SST retrievals are mitigated by retraining the regression coefficients daily, using matchups with drifting and tropical moored buoys within moving time windows. In RAN, the training windows are centered at the processed day. In NRT, the same size windows are employed but delayed in time, ending four to ten days prior to the processed day. Delayed-mode RAN reprocessing follows the NRT with a two-month lag, resulting in a higher quality and a more consistent SST record. In addition to its completeness, the newly created Metop-FG RAN1 SST dataset shows very close agreement with in situ data (including the fully independent Argo floats), well within the NOAA specifications for accuracy (global mean bias; ±0.2 K) and precision (global standard deviation; 0.6 K) in a ~20% clear-sky domain (percent of clear-sky SST pixels to the total of ice-free ocean). All performance statistics are stable in time, and consistent across the three platforms. The Metop-FG RAN1 data set is archived at the NASA JPL PO.DAAC and NOAA NCEI. This paper documents the newly created dataset and evaluates its performance.


2021 ◽  
Author(s):  
Aliaksei Mazheika ◽  
Yanggang Wang ◽  
Rosendo Valero ◽  
Francesc Vines ◽  
Francesc Illas ◽  
...  

Abstract Using subgroup discovery, an artificial intelligence (AI) approach that identifies statistically exceptional subgroups in a dataset, we develop a strategy for a rational design of catalytic materials. We identify “materials genes” (features of catalyst materials) that correlate with mechanisms that trigger, facilitate, or hinder the activation of carbon dioxide (CO2) towards a chemical conversion. The approach is used to address the conversion of CO2 to fuels and other useful chemicals. The AI model is trained on high-throughput first-principles data for a broad family of oxides. We demonstrate that bending of the gas-phase linear molecule, previously proposed as the indicator of activation, is insufficient to account for the good catalytic performance of experimentally characterized oxide surfaces. Instead, our AI approach identifies the common feature of these surfaces in the binding of a molecular O atom to a surface cation, which results in a strong elongation and therefore weakening of one molecular C-O bond. The same conclusion is obtained by using the bending indicator only when incombination with the Sabatier principle. Based on these findings, we propose a set of new promising oxide-based catalyst materials for CO2 conversion, and a recipe to find more. Our analysis also reveals advantages of local pattern discovery methods such as subgroup discovery over standard global regression approaches in discovering combinations of materials properties that result in a catalytic activation.


Energies ◽  
2021 ◽  
Vol 14 (13) ◽  
pp. 3965
Author(s):  
Daniel Chuquin-Vasco ◽  
Francis Parra ◽  
Nelson Chuquin-Vasco ◽  
Juan Chuquin-Vasco ◽  
Vanesa Lo-Iacono-Ferreira

The objective of this research was to design a neural network (ANN) to predict the methanol flux at the outlet of a carbon dioxide dehydrogenation plant. For the development of the ANN, a database was generated, in the open-source simulation software “DWSIM”, from the validation of a process described in the literature. The sample consists of 133 data pairs with four inputs: reactor pressure and temperature, mass flow of carbon dioxide and hydrogen, and one output: flow of methanol. The ANN was designed using 12 neurons in the hidden layer and it was trained with the Levenberg–Marquardt algorithm. In the training, validation and testing phase, a global mean square (RMSE) value of 0.0085 and a global regression coefficient R of 0.9442 were obtained. The network was validated through an analysis of variance (ANOVA), where the p-value for all cases was greater than 0.05, which indicates that there are no significant differences between the observations and those predicted by the ANN. Therefore, the designed ANN can be used to predict the methanol flow at the exit of a dehydrogenation plant and later for the optimization of the system.


Atmosphere ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 593
Author(s):  
Javed Mallick ◽  
Mohammed K. AlMesfer ◽  
Vijay P. Singh ◽  
Ibrahim I. Falqi ◽  
Chander Kumar Singh ◽  
...  

The Normalized Difference Vegetation Index (NDVI) and rainfall data were used to model the spatial relationship between vegetation and rainfall. Their correlation in previous studies was typically based on a global regression model, which assumed that the correlation was constant across space. The NDVI–rainfall association, on the other hand, is spatially non-stationary, non-linear, scale-dependent, and influenced by local factors (e.g., soil background). In this study, two statistical methods are used in the modeling, i.e., traditional ordinary least squares (OLS) regression and geographically weighted regression (GWR), to evaluate the NDVI–rainfall relationship. The GWR was implemented annually in the growing seasons of 2000 and 2016, using climate data (Normalized Vegetation Difference Index and rainfall). The NDVI–rainfall relationship in the studied Bisha watershed (an eco-sensitive zone with a complex landscape) was found to have a stable operating scale of around 12 km. The findings support the hypothesis that the OLS model’s average impression could not accurately represent local conditions. By addressing spatial non-stationarity, the GWR approach greatly improves the model’s accuracy and predictive ability. In analyzing the relationship between NDVI patterns and rainfall, our research has shown that GWR outperforms a global OLS model. This superiority stems primarily from the consideration of the relationship’s spatial variance across the study area. Global regression techniques such as OLS can overlook local details, implying that a large portion of the variance in NDVI is unexplained. It appears that rainfall is the most significant factor in deciding the distribution of vegetation in these regions. Furthermore, rainfall had weak relationships with areas predominantly located around wetlands, suggesting the need for additional factors to describe NDVI variations. The GWR method performed better in terms of accuracy, predictive power, and reduced residual autocorrelation. Thus, GWR is recommended as an explanatory and exploratory technique when relations between variables are subject to spatial variability. Since the GWR is a local form of spatial analysis that aligned to local conditions, it has the potential for more accurate prediction; however, a larger amount of data is needed to allow a reliable local fitting.


Sign in / Sign up

Export Citation Format

Share Document