scholarly journals Prediction of major international soccer tournaments based on team-specific regularized Poisson regression: An application to the FIFA World Cup 2014

Author(s):  
Andreas Groll ◽  
Gunther Schauberger ◽  
Gerhard Tutz

AbstractIn this article an approach for the analysis and prediction of international soccer match results is proposed. It is based on a regularized Poisson regression model that includes various potentially influential covariates describing the national teams’ success in previous FIFA World Cups. Additionally, within the generalized linear model (GLM) framework, also differences of team-specific effects are incorporated. In order to achieve variable selection and shrinkage, we use tailored Lasso approaches. Based on preceding FIFA World Cups, two models for the prediction of the FIFA World Cup 2014 are fitted and investigated. Based on the model estimates, the FIFA World Cup 2014 is simulated repeatedly and winning probabilities are obtained for all teams. Both models favor the actual FIFA World Champion Germany.

2021 ◽  
Vol 1 (1) ◽  
pp. 35-42
Author(s):  
Muhammad Bangkit Riksa Utama ◽  
Nusar Hajarisman

Abstract. In various experiments, data interactions take the form of discrete numbers or counts. The model that can be used for these data is the Poisson regression model. Poisson regression is included in the Generalized Linear Model (GLM). Poisson regression in general is very important in various fields and agreed to receive special attention. Often this model needs many independent variables. Then there needs to be a selection of poisson regression model variables. Due to the number of independent variables that exist, the selection of variables is carried out. Variable selection techniques that are commonly known are the forward, backward method, akaike information criteria and several other methods. In this paper, we will discuss one method of selecting variables in the Poisson regression model that has been made in the algorithm created by Famoye and Rothe. The algorithm created will be compared with the algorithm made by Nordberg. In this study data were used on Infant Mortality Rate (IMR) in West Java Province. Abstrak. Dalam berbagai eksperimen, seringkali data berupa bilangan diskrit atau cacah. Model yang dapat digunakan untuk data tersebut diantaranya adalah model regresi poisson. Regresi poisson termasuk kedalam Generalized Linear Model (GLM).  Regresi poisson secara umum sangat penting dalam berbagai bidang dan karenanya patut mendapat perhatian khusus. Seringkali model ini melibatkan banyak variabel independen. Maka perlu adanya cara untuk mempertimbangkan pemilihan variabel model regresi poisson. Dikarenakan banyaknya variabel independen yang ada maka  dilakukan penyeleksian variabel. Teknik pemilihan variabel yang sudah biasa dikenal yaitu metode forward, backward, akaike information criterion dan beberapa metode lainnya. Pada makalah ini akan dibahas mengenai salah satu metode pemilihan variabel dalam model regresi poisson yang telah dibentuk dalam algoritma yang dibuat oleh Famoye dan Rothe. Algortitma yang dibuat ini akan dibandingkan dengan algoritma yang telah dibuat oleh Nordberg. Pada penelitian ini  digunakan data mengenai Angka Kematian Bayi (AKB) di Provinsi Jawa Barat.


F1000Research ◽  
2021 ◽  
Vol 10 ◽  
pp. 548
Author(s):  
Benedicta B. Aladeitan ◽  
Olukayode Adebimpe ◽  
Adewale F. Lukman ◽  
Olajumoke Oludoun ◽  
Oluwakemi E. Abiodun

Background: Multicollinearity greatly affects the Maximum Likelihood Estimator (MLE) efficiency in both the linear regression model and the generalized linear model. Alternative estimators to the MLE include the ridge estimator, the Liu estimator and the Kibria-Lukman (KL) estimator, though literature shows that the KL estimator is preferred. Therefore, this study sought to modify the KL estimator to mitigate the Poisson Regression Model with multicollinearity. Methods: A simulation study and a real-life study were carried out and the performance of the new estimator was compared with some of the existing estimators. Results: The simulation result showed the new estimator performed more efficiently than the MLE, Poisson Ridge Regression Estimator (PRE), Poisson Liu Estimator (PLE) and the Poisson KL (PKL) estimators. The real-life application also agreed with the simulation result. Conclusions: In general, the new estimator performed more efficiently than the MLE, PRE, PLE and the PKL when multicollinearity was present.


F1000Research ◽  
2021 ◽  
Vol 10 ◽  
pp. 548
Author(s):  
Benedicta B. Aladeitan ◽  
Olukayode Adebimpe ◽  
Adewale F. Lukman ◽  
Olajumoke Oludoun ◽  
Oluwakemi E. Abiodun

Background: Multicollinearity greatly affects the Maximum Likelihood Estimator (MLE) efficiency in both the linear regression model and the generalized linear model. Alternative estimators to the MLE include the ridge estimator, the Liu estimator and the Kibria-Lukman (KL) estimator, though literature shows that the KL estimator is preferred. Therefore, this study sought to modify the KL estimator to mitigate the Poisson Regression Model with multicollinearity. Methods: A simulation study and a real-life study was carried out and the performance of the new estimator was compared with some of the existing estimators. Results: The simulation result showed the new estimator performed more efficiently than the MLE, Poisson Ridge Regression Estimator (PRE), Poisson Liu Estimator (PLE) and the Poisson KL (PKL) estimators. The real-life application also agreed with the simulation result. Conclusions: In general, the new estimator performed more efficiently than the MLE, PRE, PLE and the PKL when multicollinearity was present.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Huihui Zhang ◽  
Yini Liu ◽  
Fangyao Chen ◽  
Baibing Mi ◽  
Lingxia Zeng ◽  
...  

Abstract Background Since December 2019, the coronavirus disease 2019 (COVID-19) has spread quickly among the population and brought a severe global impact. However, considerable geographical disparities in the distribution of COVID-19 incidence existed among different cities. In this study, we aimed to explore the effect of sociodemographic factors on COVID-19 incidence of 342 cities in China from a geographic perspective. Methods Official surveillance data about the COVID-19 and sociodemographic information in China’s 342 cities were collected. Local geographically weighted Poisson regression (GWPR) model and traditional generalized linear models (GLM) Poisson regression model were compared for optimal analysis. Results Compared to that of the GLM Poisson regression model, a significantly lower corrected Akaike Information Criteria (AICc) was reported in the GWPR model (61953.0 in GLM vs. 43218.9 in GWPR). Spatial auto-correlation of residuals was not found in the GWPR model (global Moran’s I = − 0.005, p = 0.468), inferring the capture of the spatial auto-correlation by the GWPR model. Cities with a higher gross domestic product (GDP), limited health resources, and shorter distance to Wuhan, were at a higher risk for COVID-19. Furthermore, with the exception of some southeastern cities, as population density increased, the incidence of COVID-19 decreased. Conclusions There are potential effects of the sociodemographic factors on the COVID-19 incidence. Moreover, our findings and methodology could guide other countries by helping them understand the local transmission of COVID-19 and developing a tailored country-specific intervention strategy.


Author(s):  
J. M. Muñoz-Pichardo ◽  
R. Pino-Mejías ◽  
J. García-Heras ◽  
F. Ruiz-Muñoz ◽  
M. Luz González-Regalado

Author(s):  
Narges Motalebi ◽  
Mohammad Saleh Owlia ◽  
Amirhossein Amiri ◽  
Mohammad Saber Fallahnezhad

Author(s):  
Isabel Cardoso ◽  
Peder Frederiksen ◽  
Ina Olmer Specht ◽  
Mina Nicole Händel ◽  
Fanney Thorsteinsdottir ◽  
...  

This study reports age- and sex-specific incidence rates of juvenile idiopathic arthritis (JIA) in complete Danish birth cohorts from 1992 through 2002. Data were obtained from the Danish registries. All persons born in Denmark, from 1992–2002, were followed from birth and until either the date of first diagnosis recording, death, emigration, 16th birthday or administrative censoring (17 May 2017), whichever came first. The number of incident JIA cases and its incidence rate (per 100,000 person-years) were calculated within sex and age group for each of the birth cohorts. A multiplicative Poisson regression model was used to analyze the variation in the incidence rates by age and year of birth for boys and girls separately. The overall incidence of JIA was 24.1 (23.6–24.5) per 100,000 person-years. The rate per 100,000 person-years was higher among girls (29.9 (29.2–30.7)) than among boys (18.5 (18.0–19.1)). There were no evident peaks for any age group at diagnosis for boys but for girls two small peaks appeared at ages 0–5 years and 12–15 years. This study showed that the incidence rates of JIA in Denmark were higher for girls than for boys and remained stable over the observed period for both sexes.


Sign in / Sign up

Export Citation Format

Share Document