Comparative Analysis of Empirical Bayes and Bayesian Hierarchical Models in Hotspot Identification

Author(s):  
Xiaoyu Guo ◽  
Lingtao Wu ◽  
Yajie Zou ◽  
Lee Fawcett

Hotspot identification is an important step in the highway safety management process. Errors in hotspot identification (HSID) may result in an inefficient use of limited resources for safety improvements. The empirical Bayesian (EB) HSID has been widely applied as an effective approach in identifying hotspots. However, there are some limitations with the EB approach. It assumes that the parameter estimates of the safety performance function (SPF) are correct without any uncertainty, and does not consider temporal instability in crashes, which has been reported in recent studies. The Bayesian hierarchical model is an emerging technique that addresses the limitations of the EB method. Thus, the objective of this study is to compare the performance of the standard EB method and the Bayesian hierarchical model in identifying hotspots. Three methods (crash rate, EB, and the Bayesian hierarchical model) were applied to identify risky intersections with different significance levels. Four evaluation tests (site consistency, method consistency, total rank differences, and Poisson mean differences tests) were conducted to assess the performance of these three methods. The testing results suggest that: (1) the Bayesian hierarchical model outperforms the crash rate and the EB methods in most cases, and the Bayesian hierarchical model improves the accuracy of HSID significantly; and (2) hotspots identified with crash rates are generally unreliable. This is significant for roadway agencies and practitioners trying to accurately rank sites in the roadway network to effectively manage safety investments. Roadway agencies and practitioners are encouraged to consider the Bayesian hierarchical model in identifying hotspots.

2019 ◽  
Vol 98 (4) ◽  
pp. 1601-1609 ◽  
Author(s):  
Ariane Gonçalves Gotuzzo ◽  
Miriam Piles ◽  
Raquel Pillon Della-Flora ◽  
Jerusa Martins Germano ◽  
Janaina Scaglioni Reis ◽  
...  

2020 ◽  
Vol 16 (4) ◽  
pp. 271-289
Author(s):  
Nathan Sandholtz ◽  
Jacob Mortensen ◽  
Luke Bornn

AbstractEvery shot in basketball has an opportunity cost; one player’s shot eliminates all potential opportunities from their teammates for that play. For this reason, player-shot efficiency should ultimately be considered relative to the lineup. This aspect of efficiency—the optimal way to allocate shots within a lineup—is the focus of our paper. Allocative efficiency should be considered in a spatial context since the distribution of shot attempts within a lineup is highly dependent on court location. We propose a new metric for spatial allocative efficiency by comparing a player’s field goal percentage (FG%) to their field goal attempt (FGA) rate in context of both their four teammates on the court and the spatial distribution of their shots. Leveraging publicly available data provided by the National Basketball Association (NBA), we estimate player FG% at every location in the offensive half court using a Bayesian hierarchical model. Then, by ordering a lineup’s estimated FG%s and pairing these rankings with the lineup’s empirical FGA rate rankings, we detect areas where the lineup exhibits inefficient shot allocation. Lastly, we analyze the impact that sub-optimal shot allocation has on a team’s overall offensive potential, demonstrating that inefficient shot allocation correlates with reduced scoring.


2019 ◽  
Vol 15 (4) ◽  
pp. 313-325 ◽  
Author(s):  
Martin Ingram

Abstract A well-established assumption in tennis is that point outcomes on each player’s serve in a match are independent and identically distributed (iid). With this assumption, it is enough to specify the serve probabilities for both players to derive a wide variety of event distributions, such as the expected winner and number of sets, and number of games. However, models using this assumption, which we will refer to as “point-based”, have typically performed worse than other models in the literature at predicting the match winner. This paper presents a point-based Bayesian hierarchical model for predicting the outcome of tennis matches. The model predicts the probability of winning a point on serve given surface, tournament and match date. Each player is given a serve and return skill which is assumed to follow a Gaussian random walk over time. In addition, each player’s skill varies by surface, and tournaments are given tournament-specific intercepts. When evaluated on the ATP’s 2014 season, the model outperforms other point-based models, predicting match outcomes with greater accuracy (68.8% vs. 66.3%) and lower log loss (0.592 vs. 0.641). The results are competitive with approaches modelling the match outcome directly, demonstrating the forecasting potential of the point-based modelling approach.


Sign in / Sign up

Export Citation Format

Share Document