sem algorithm
Recently Published Documents


TOTAL DOCUMENTS

26
(FIVE YEARS 8)

H-INDEX

5
(FIVE YEARS 0)

Author(s):  
Kerstin Erfurth ◽  
Marcus Groß ◽  
Ulrich Rendtel ◽  
Timo Schmid

AbstractComposite spatial data on administrative area level are often presented by maps. The aim is to detect regional differences in the concentration of subpopulations, like elderly persons, ethnic minorities, low-educated persons, voters of a political party or persons with a certain disease. Thematic collections of such maps are presented in different atlases. The standard presentation is by Choropleth maps where each administrative unit is represented by a single value. These maps can be criticized under three aspects: the implicit assumption of a uniform distribution within the area, the instability of the resulting map with respect to a change of the reference area and the discontinuities of the maps at the borderlines of the reference areas which inhibit the detection of regional clusters.In order to address these problems we use a density approach in the construction of maps. This approach does not enforce a local uniform distribution. It does not depend on a specific choice of area reference system and there are no discontinuities in the displayed maps. A standard estimation procedure of densities are Kernel density estimates. However, these estimates need the geo-coordinates of the single units which are not at disposal as we have only access to the aggregates of some area system. To overcome this hurdle, we use a statistical simulation concept. This can be interpreted as a Simulated Expectation Maximisation (SEM) algorithm of Celeux et al (1996). We simulate observations from the current density estimates which are consistent with the aggregation information (S-step). Then we apply the Kernel density estimator to the simulated sample which gives the next density estimate (E-Step).This concept has been first applied for grid data with rectangular areas, see Groß et al (2017), for the display of ethnic minorities. In a second application we demonstrated the use of this approach for the so-called “change of support” (Bradley et al 2016) problem. Here Groß et al (2020) used the SEM algorithm to recalculate case numbers between non-hierarchical administrative area systems. Recently Rendtel et al (2021) applied the SEM algorithm to display spatial-temporal clusters of Corona infections in Germany.Here we present three modifications of the basic SEM algorithm: 1) We introduce a boundary correction which removes the underestimation of kernel density estimates at the borders of the population area. 2) We recognize unsettled areas, like lakes, parks and industrial areas, in the computation of the kernel density. 3) We adapt the SEM algorithm for the computation of local percentages which are important especially in voting analysis.We evaluate our approach against several standard maps by means of the local voting register with known addresses. In the empirical part we apply our approach for the display of voting results for the 2016 election of the Berlin parliament. We contrast our results against Choropleth maps and show new possibilities for reporting spatial voting results.


Mathematics ◽  
2021 ◽  
Vol 9 (21) ◽  
pp. 2834
Author(s):  
José Antonio Roldán-Nofuentes ◽  
Saad Bouh Regad

The average kappa coefficient of a binary diagnostic test is a parameter that measures the average beyond-chance agreement between the diagnostic test and the gold standard. This parameter depends on the accuracy of the diagnostic test and also on the disease prevalence. This article studies the comparison of the average kappa coefficients of two binary diagnostic tests when the gold standard is not applied to all individuals in a random sample. In this situation, known as partial disease verification, the disease status of some individuals is a missing piece of data. Assuming that the missing data mechanism is missing at random, the comparison of the average kappa coefficients is solved by applying two computational methods: the EM algorithm and the SEM algorithm. With the EM algorithm the parameters are estimated and with the SEM algorithm their variances-covariances are estimated. Simulation experiments have been carried out to study the sizes and powers of the hypothesis tests studied, obtaining that the proposed method has good asymptotic behavior. A function has been written in R to solve the proposed problem, and the results obtained have been applied to the diagnosis of Alzheimer's disease.


Symmetry ◽  
2021 ◽  
Vol 13 (7) ◽  
pp. 1286
Author(s):  
Yenni Angraini ◽  
Khairil Anwar Notodiputro ◽  
Henk Folmer ◽  
Asep Saefuddin ◽  
Toni Toharudin

This paper deals with symmetrical data that can be modelled based on Gaussian distribution, such as linear mixed models for longitudinal data. The latent factor linear mixed model (LFLMM) is a method generally used for analysing changes in high-dimensional longitudinal data. It is usual that the model estimates are based on the expectation-maximization (EM) algorithm, but unfortunately, the algorithm does not produce the standard errors of the regression coefficients, which then hampers testing procedures. To fill in the gap, the Supplemented EM (SEM) algorithm for the case of fixed variables is proposed in this paper. The computational aspects of the SEM algorithm have been investigated by means of simulation. We also calculate the variance matrix of beta using the second moment as a benchmark to compare with the asymptotic variance matrix of beta of SEM. Both the second moment and SEM produce symmetrical results, the variance estimates of beta are getting smaller when number of subjects in the simulation increases. In addition, the practical usefulness of this work was illustrated using real data on political attitudes and behaviour in Flanders-Belgium.


2021 ◽  
Author(s):  
Nguyen Thi Ngan ◽  
Bui Huy Khoi
Keyword(s):  

2021 ◽  
Vol 1933 (1) ◽  
pp. 012066
Author(s):  
Nguyen Thi Ngan ◽  
Bui Huy Khoi
Keyword(s):  

2021 ◽  
Vol 1860 (1) ◽  
pp. 012013
Author(s):  
D S Mai ◽  
P H Hai ◽  
D T Cuong ◽  
B H Khoi
Keyword(s):  

2020 ◽  
Vol 27 (3) ◽  
pp. 53-64
Author(s):  
V. V. Glinskiy ◽  
Yu. N. Ismaiylova

The article summarizes research results of the study on the problem of assessing the differentiation level of socio-economic development of territorial units of the Russian Federation. The authors propose an approach to measuring differentiation using mixtures of probability distributions.This technique was developed and tested on real data that allows one to determine the presence or absence of interregional differentiation. The research hypothesis, that interterritorial differentiation is estimated by a specific statistical indicator selected based on a content, qualitative analysis, served as a theoretical platform of this methodology. Differentiation is practically absent if the entire statistical population is described by a single law of probability distribution. If the statistical population is described by a mixture of probability distributions, then one should expect the presence of a significant level of differentiation by the considered indicator.In mathematical statistics, the problem of separating a mixture of probability distributions (estimating parameters of distribution densities and weighting coefficients) is traditionally solved using several similar methods. For example, the expectation-maximization (EM) algorithm, median modifications of the EM-algorithm, SEM-algorithm, taking into account the specifics of the selected object (constituent entities of the Russian Federation a small sample). To solve this problem, the authors used the SEM algorithm. As the information base of the empirical study, official statistics were used (open data from the Federal State Statistics Service).The typologies of the constituent entities of the Russian Federation were identified based on two characteristics within 2005-2017-time interval. The first one being the level of violence (using the “homicide rate” indicator the number of homicides and attempted murders per 100000 population). And second average per capita income, which made it possible, among other things, to additionally test the hypothesis of the traditional use of differentiation trends in the level of violence as an indicator of economic inequality. According to the authors, the results of this study can be used as instrumental and informational support for managerial decisions aimed at regulating the differentiation of Russian regions by the level of violence and economic inequality.


Author(s):  
JORGE DOMINGUEZ-BLANCO ◽  
IGNACIO CASTRO-ABANCÉNS ◽  
GABRIEL CEPEDA-CARRION

This study explores and examines the relationship between the success of Research and Development (R&D) consortia and the factors that determine this success. Most studies of R&D consortium success are based on a set of observable variables as antecedents of this success. In our study, we recognise the complexity of the problem and use latent variables as a set of weighted observable variables, rather than classical variables. Empirical insights are provided by applying a second-generation technique, namely the PLS-SEM algorithm, to analyse the data gathered from R&D consortia in Spain. The results demonstrate the existence of constructs (partner attributes, alliance attributes, environment attributes and leadership) that encompass the determinant factors for success, and also show that the attributes of the partners and the characteristics of the project alliances exert a positive influence on the success of the joint R&D project.


Sign in / Sign up

Export Citation Format

Share Document