The Estimation of Population Microdata by Using Data from Small Area Statistics and Samples of Anonymised Records

1998 ◽  
Vol 30 (5) ◽  
pp. 785-816 ◽  
Author(s):  
P Williamson ◽  
M Birkin ◽  
P H Rees

Census data can be represented both as lists and as tabulations of household/individual attributes. List representation of Census data offers greater flexibility, as the exploration of interrelationships between population characteristics is limited only by the quality and scope of the data collected. Unfortunately, the released lists of household/individual attributes (Samples of Anonymised Records, SARs) are spatially referenced only to areas (single or merged districts) with populations of 120 000 or more, whereas released tabulations are available for units as small as single enumeration districts (Small Area Statistics, SAS). Intuitively, it should be possible to derive list-based estimates of enumeration district populations by combining information contained in the SAR and the SAS. In this paper we explore the range of solutions that could be adapted to this problem which, ultimately, is presented as a complex combinatorial optimisation problem. Various techniques of combinatorial optimisation are tested, and preliminary results from the best performing algorithm are evaluated. Through this process, the lack of suitable test statistics for the comparison of observed and expected tabulations of population data is highlighted.

2020 ◽  
Vol 49 (Supplement_1) ◽  
pp. i15-i25
Author(s):  
Daniela Fecht ◽  
Samantha Cockings ◽  
Susan Hodgson ◽  
Frédéric B Piel ◽  
David Martin ◽  
...  

Abstract Temporally and spatially highly resolved information on population characteristics, including demographic profile (e.g. age and sex), ethnicity and socio-economic status (e.g. income, occupation, education), are essential for observational health studies at the small-area level. Time-relevant population data are critical as denominators for health statistics, analytics and epidemiology, to calculate rates or risks of disease. Demographic and socio-economic characteristics are key determinants of health and important confounders in the relationship between environmental contaminants and health. In many countries, census data have long been the source of small-area population denominators and confounder information. A strength of the traditional census model has been its careful design and high level of population coverage, allowing high-quality detailed data to be released for small areas periodically, e.g. every 10 years. The timeliness of data, however, becomes a challenge when temporally and spatially highly accurate annual (or even more frequent) data at high spatial resolution are needed, for example, for health surveillance and epidemiological studies. Additionally, the approach to collecting demographic population information is changing in the era of open and big data and may eventually evolve to using combinations of administrative and other data, supplemented by surveys. We discuss different approaches to address these challenges including (i) the US American Community Survey, a rolling sample of the US population census, (ii) the use of spatial analysis techniques to compile temporally and spatially high-resolution demographic data and (iii) the use of administrative and big data sources as proxies for demographic characteristics.


2015 ◽  
Vol 31 (3) ◽  
pp. 431-451 ◽  
Author(s):  
Dilek Yildiz ◽  
Peter W.F. Smith

Abstract Administrative data sources are an important component of population data collection and they have been used in census data production in the Nordic countries since the 1960s. A large amount of information about the population is already collected in administrative data sources by governments. However, there are some challenges to using administrative data sources to estimate population counts by age, sex, and geographical area as well as population characteristics. The main limitation with the administrative data sources is that they only collect information from a subset of the population about specific events, and this may result in either undercoverage or overcoverage of the population. Another issue with the administrative data sources is that the information may not have the same quality for all population groups. This research aims to correct an inaccurate administrative data source by combining aggregate-level administrative data with more accurate marginal distributions or two-way marginal information from an auxiliary data source and produce accurate population estimates in the absence of a traditional census. The methodology developed is applied to estimate population counts by age, sex, and local authority area in England and Wales. The administrative data source used is the Patient Register which suffers from overcoverage, particularly for people between the ages of 20 and 50.


2002 ◽  
Vol 34 (6) ◽  
pp. 1021-1035 ◽  
Author(s):  
Richard Mitchell ◽  
Danny Dorling ◽  
David Martin ◽  
Ludi Simpson

The 1991 UK Decennial Census missed about 1.2 million people. These missing individuals present a serious challenge to any census user interested in measuring intercensal change, particularly amongst the most marginalised groups in society who were prominent amongst the missing population. Recently, a web-based system for accessing census data from the 1971, 1981, and 1991 censuses was launched ( www.census.ac.uk/cdu/lct ). The ‘LCT’ package also provides access to a set of 1991 small area statistics (SAS) which have been corrected to compensate for the missing million. The authors explain the methods used for adjusting the SAS counts, provide examples of the differences between analysis with the adjusted and unadjusted data, and recommend the use of the new data set to all those interested in intercensal change.


1993 ◽  
Vol 25 (9) ◽  
pp. 1267-1277 ◽  
Author(s):  
C S Morphet

Census enumeration districts (EDs) are not ‘natural’ areal units, and a strong presumption exists against their use for cartographic or analytical purposes. There is, however, some reason to suppose that their boundaries may coincide with divisions in the underlying social geography of an area, and that they therefore contain information which should not be lost in processing. A study of Newcastle upon Tyne EDs shows that ED boundaries do not, in fact, mark the boundaries of distinct areas. The presumption against the use of ED boundaries for cartographic representation is reinforced.


2017 ◽  
Vol 12 (1) ◽  
Author(s):  
Azizur Rahman

Estimation of health-related characteristics at a fine local geographic level is vital for effective health promotion programmes, provision of better health services and population-specific health planning and management. Lack of a micro-dataset readily available for attributes of individuals at small areas negatively impacts the ability of local and national agencies to manage serious health issues and related risks in the community. A solution to this challenge would be to develop a method that simulates reliable small-area statistics. This paper provides a significant appraisal of the methodologies for estimating health-related characteristics of populations at geographical limited areas. Findings reveal that a range of methodologies are in use, which can be classified as three distinct set of approaches: i) indirect standardisation and individual level modelling; ii) multilevel statistical modelling; and iii) micro-simulation modelling. Although each approach has its own strengths and weaknesses, it appears that microsimulation- based spatial models have significant robustness over the other methods and also represent a more precise means of estimating health-related population characteristics over small areas.


1993 ◽  
Vol 32 (01) ◽  
pp. 79-81 ◽  
Author(s):  
P. Millard ◽  
S. McClean

Abstract:The flow of patients through geriatric hospitals has been previously described in terms of acute and long-stay states where the bed occupancy at a census point is modelled by a mixed exponential model. Using data for sixteen years the model was fitted to successive annual census points, in order to provide a description of temporal trends. While the number of acute patients has remained fairly stable during the period, the model shows that there has been a decrease in the number of long-stay patients. Mean lengths of stay in our geriatric hospital before death or discharge have decreased during the study period for both acute and long-stay patients.Using these fits of the mixed exponential model to census data, a method is provided for predicting future turnover of patients. These predictions are reasonably good, except when the turnover patterns go through a period of flux in which assumption of stability no longer holds. Overall, a methodology is presented which relates census analysis to the behaviour of admission cohorts, thus producing a means of predicting future behaviour of patients and identifying where there is a change in patterns.


Author(s):  
Brian Foley ◽  
Tony Champion ◽  
Ian Shuttleworth

AbstractThe paper compares and contrasts internal migration measured by healthcard-based administrative data with census figures. This is useful because the collection of population data, its processing, and its dissemination by statistical agencies is becoming more reliant on administrative data. Statistical agencies already use healthcard data to make migration estimates and are increasingly confident about local population estimates from administrative sources. This analysis goes further than this work as it assesses how far healthcard data can produce reliable data products of the kind to which academics are accustomed. It does this by examining migration events versus transitions over a full intercensal period; population flows into and out of small areas; and the extent to which it produces microdata on migration equivalent to that in the census. It is shown that for most demographic groups and places healthcard data is an adequate substitute for census-based migration counts, the exceptions being for student households and younger people. However, census-like information is still needed to provide covariates for analysis and this will still be required whatever the future of the traditional census.


ILR Review ◽  
1995 ◽  
Vol 48 (4) ◽  
pp. 792-811 ◽  
Author(s):  
Edward Funkhouser ◽  
Stephen J. Trejo

Using data from special supplements to the Current Population Survey (CPS), the authors track the education and hourly earnings of recent male immigrants to the United States. In terms of these measures of labor market skills, the CPS data suggest that immigrants who came in the late 1980s were more skilled than those who arrived earlier in the decade. This pattern represents a break from the steady decline in immigrant skill levels observed in 1940–80 Census data. Despite the encouraging trend over the 1980s, however, the average skills of recent immigrants remain low by historical standards.


Author(s):  
Nurkhalik Wahdanial Asbara

Technological developments and changes in government systems are developing rapidly. Both of these lead to efforts to carry out duties, protect functions and serve the community. This encourages the government to take various adjustment steps quickly in line with the dynamics of development that occur. One of them is through a population census. The population census is an important issue that must be handled properly. The population census in this study takes population data in an area based on the number of male population, female population, ratio, and population density. The data was taken and submitted to the Makassar City Statistics Agency. Population Census is a presentation of information that has the ability to present accurate information, and helps facilitate the search for a population census data. The population census is carried out every 5 years which is carried out by census officers to carry out data collection to each resident's house, the data collection process is carried out by conventional recording and submitting it to the central statistics agency for database entry. With this application, it is expected to provide convenience to Population census officers to perform the process of inputting population data and the data is directly stored in the database without having to return to the office to input again.


Sign in / Sign up

Export Citation Format

Share Document