Repeated and Combined Participation in Workforce, Education and Social Program Services: Preliminary Evidence from Maryland Administrative Data Sources

2012 ◽  
Author(s):  
Ting Zhang ◽  
David W. Stevens
2021 ◽  
pp. 1-22
Author(s):  
Emily Berg ◽  
Johgho Im ◽  
Zhengyuan Zhu ◽  
Colin Lewis-Beck ◽  
Jie Li

Statistical and administrative agencies often collect information on related parameters. Discrepancies between estimates from distinct data sources can arise due to differences in definitions, reference periods, and data collection protocols. Integrating statistical data with administrative data is appealing for saving data collection costs, reducing respondent burden, and improving the coherence of estimates produced by statistical and administrative agencies. Model based techniques, such as small area estimation and measurement error models, for combining multiple data sources have benefits of transparency, reproducibility, and the ability to provide an estimated uncertainty. Issues associated with integrating statistical data with administrative data are discussed in the context of data from Namibia. The national statistical agency in Namibia produces estimates of crop area using data from probability samples. Simultaneously, the Namibia Ministry of Agriculture, Water, and Forestry obtains crop area estimates through extension programs. We illustrate the use of a structural measurement error model for the purpose of synthesizing the administrative and survey data to form a unified estimate of crop area. Limitations on the available data preclude us from conducting a genuine, thorough application. Nonetheless, our illustration of methodology holds potential use for a general practitioner.


Author(s):  
Jonathan M Snowden ◽  
Audrey Lyndon ◽  
Peiyi Kan ◽  
Alison El Ayadi ◽  
Elliott Main ◽  
...  

Abstract Severe maternal morbidity (SMM) is a composite outcome measure that indicates serious, potentially life-threatening maternal health problems. There is great interest in defining SMM using administrative data for surveillance and research. In the US, one common way of defining SMM at the population level is an index developed by the Centers for Disease Control and Prevention. Modifications have been proposed to this index (e.g., excluding maternal transfusion); some research defines SMM using an index introduced by Bateman et al. Birth certificate data are also increasingly being used to define SMM. We compared commonly used US definitions of SMM to each other among all California births, 2007-2012, using the Kappa statistic and other measures. We also evaluated agreement between maternal morbidity fields on the birth certificate compared to claims data. Concordance was generally low between the 7 definitions of SMM analyzed (i.e., κ < 0.4 for 13 of 21 two-way comparisons), Low concordance was particularly driven by presence/absence of transfusion and claims data versus birth certificate definitions. Low agreement between administrative data-based definitions of SMM highlights that results can be expected to differ between them. Further research is needed on validity of SMM definitions, using more fine-grained data sources.


2014 ◽  
Vol 9 (1) ◽  
pp. 12-24
Author(s):  
Michael Comerford

The plethora of new data sources, combined with a growing interest in increased access to previously unpublished data, poses a set of ethical challenges regarding individual privacy. This paper sets out one aspect of those challenges: the need to anonymise data in such a form that protects the privacy of individuals while providing sufficient data utility for data users. This issue is discussed using a case study of Scottish Government’s administrative data, in which disclosure risk is examined and data utility is assessed using a potential ‘real-world’ analysis.


2015 ◽  
Vol 31 (3) ◽  
pp. 415-429 ◽  
Author(s):  
Loredana Di Consiglio ◽  
Tiziana Tuoto

Abstract The Capture-recapture method is a well-known solution for evaluating the unknown size of a population. Administrative data represent sources of independent counts of a population and can be jointly exploited for applying the capture-recapture method. Of course, administrative sources are affected by over- or undercoverage when considered separately. The standard Petersen approach is based on strong assumptions, including perfect record linkage between lists. In reality, record linkage results can be affected by errors. A simple method for achieving linkage error-unbiased population total estimates is proposed in Ding and Fienberg (1994). In this article, an extension of the Ding and Fienberg model by relaxing their conditions is proposed. The procedures are illustrated for estimating the total number of road casualties, on the basis of a probabilistic record linkage between two administrative data sources. Moreover, a simulation study is developed, providing evidence that the adjusted estimator always performs better than the Petersen estimator.


2007 ◽  
Vol 15 (4) ◽  
pp. 411-424 ◽  
Author(s):  
Anneli Uusküla ◽  
Kristiina Rajaleid ◽  
Ave Talu ◽  
Katri Abel ◽  
Kristi Rüütel ◽  
...  

2015 ◽  
Vol 31 (3) ◽  
pp. 431-451 ◽  
Author(s):  
Dilek Yildiz ◽  
Peter W.F. Smith

Abstract Administrative data sources are an important component of population data collection and they have been used in census data production in the Nordic countries since the 1960s. A large amount of information about the population is already collected in administrative data sources by governments. However, there are some challenges to using administrative data sources to estimate population counts by age, sex, and geographical area as well as population characteristics. The main limitation with the administrative data sources is that they only collect information from a subset of the population about specific events, and this may result in either undercoverage or overcoverage of the population. Another issue with the administrative data sources is that the information may not have the same quality for all population groups. This research aims to correct an inaccurate administrative data source by combining aggregate-level administrative data with more accurate marginal distributions or two-way marginal information from an auxiliary data source and produce accurate population estimates in the absence of a traditional census. The methodology developed is applied to estimate population counts by age, sex, and local authority area in England and Wales. The administrative data source used is the Patient Register which suffers from overcoverage, particularly for people between the ages of 20 and 50.


Author(s):  
Nadine Bachbauer

BackgroundNEPS-SC6-ADIAB is a new linked data product containing survey data of Starting Cohort 6 of the German National Educational Panel Study (NEPS) and administrative employment data from the Institute for Employment Research (IAB), the research institute of the Federal Employment Agency. NEPS is provided by the Leibniz Institute for Educational Trajectories (LIfBi). Starting Cohort 6 of this panel survey includes adults in their professional life, the survey focuses on education in adulthood and lifelong learning. The administrative data in NEPS-SC6-ADIAB consist of comprehensive information on the employment histories. ObjectivesCombining these two data sources increases for example the information about individual employment history. Overall, the data volume is increased by the linkage between the survey data and the administrative data. MethodsA record linkage process was used to link the two data sources. The data access is free for the whole scientific community. In addition to a large number of On-site access locations within Germany, there are also international On-site access locations. Including London and Colchester. In addition a Remote Data Access is offered. ConclusionsThis data linkage project is very innovative and creates an extensive database, which results in extensive analytical potential. A short application example is made to exemplify the comprehensive analytical potential of NEPS-SC6-ADIAB. This ongoing project deals with nonresponse in survey data. The linked data has a variety of variables collected in both data sources, administratively and through the NEPS survey, allowing for comparative analyses. In this case an idea to compensate nonresponse in income data with administrative data is drawn.


Author(s):  
Catherine Eastwood ◽  
Keith Denny ◽  
Maureen Kelly ◽  
Hude Quan

Theme: Data and Linkage QualityObjectives: To define health data quality from clinical, data science, and health system perspectives To describe some of the international best practices related to quality and how they are being applied to Canada’s administrative health data. To compare methods for health data quality assessment and improvement in Canada (automated logical checks, chart quality indicators, reabstraction studies, coding manager perspectives) To highlight how data linkage can be used to provide new insights into the quality of original data sources To highlight current international initiatives for improving coded data quality including results from current ICD-11 field trials Dr. Keith Denny: Director of Clinical Data Standards and Quality, Canadian Insititute for Health Information (CIHI), Adjunct Research Professor, Carleton University, Ottawa, ON. He provides leadership for CIHI’s information quality initiatives and for the development and application of clinical classifications and terminology standards. Maureen Kelly: Manager of Information Quality at CIHI, Ottawa, ON. She leads CIHI’s corporate quality program that is focused on enhancing the quality of CIHI’s data sources and information products and to fostering CIHI’s quality culture. Dr. Cathy Eastwood: Scientific Manager, Associate Director of Alberta SPOR Methods & Development Platform, Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB. She has expertise in clinical data collection, evaluation of local and systemic data quality issues, disease classification coding with ICD-10 and ICD-11. Dr. Hude Quan: Professor, Community Health Sciences, Cumming School of Medicine, University of Calgary, Director Alberta SPOR Methods Platform; Co-Chair of Hypertension Canada, Co-Chair of Person to Population Health Collaborative of the Libin Cardiovascular Institute in Calgary, AB. He has expertise in assessing, validating, and linking administrative data sources for conducting data science research including artificial intelligence methods for evaluating and improving data quality. Intended Outcomes:“What is quality health data?” The panel of experts will address this common question by discussing how to define high quality health data, and measures being taken to ensure that they are available in Canada. Optimizing the quality of clinical-administrative data, and their use-value, first requires an understanding of the processes used to create the data. Subsequently, we can address the limitations in data collection and use these data for diverse applications. Current advances in digital data collection are providing more solutions to improve health data quality at lower cost. This panel will describe a number of quality assessment and improvement initiatives aimed at ensuring that health data are fit for a range of secondary uses including data linkage. It will also discuss how the need for the linkage and integration of data sources can influence the views of the data source’s fitness for use. CIHI content will include: Methods for optimizing the value of clinical-administrative data CIHI Information Quality Framework Reabstraction studies (e.g. physician documentation/coders’ experiences) Linkage analytics for data quality University of Calgary content will include: Defining/measuring health data quality Automated methods for quality assessment and improvement ICD-11 features and coding practices Electronic health record initiatives


2015 ◽  
Vol 25 (4) ◽  
pp. 360-369 ◽  
Author(s):  
S. O'Donnell ◽  
S. Vanderloo ◽  
L. McRae ◽  
J. Onysko ◽  
S. B. Patten ◽  
...  

Background.To compare trends in the estimated prevalence of mood and/or anxiety disorders identified from two data sources (self-report and administrative). Reviewing, synthesising and interpreting data from these two sources will help identify potential factors that underlie the observed estimates and inform public health action.Method.We used self-reported, diagnosed mood and/or anxiety disorder cases from the Canadian Community Health Survey (CCHS) across a 5-year span (from 2003 to 2009) to estimate the prevalence among the Canadian population aged ≥15 years. We also estimated the prevalence of mood and/or anxiety disorders using the Canadian Chronic Disease Surveillance System (CCDSS), which identified cases using ICD-9/-10-CA codes from physician billing claims and hospital discharge records during the same time period. The prevalence rates for mood and/or anxiety disorders were compared across the CCHS and CCDSS by age and sex for all available years of data from 2003 to 2009. Summary rates were age-standardised to the Canadian population as of 1 October 1991.Results.In 2009, the prevalence of mood and/or anxiety disorders was 9.4% using self-reported data v. 11.3% using administrative data. Prevalence rates obtained from administrative data were consistently higher than those from self-report for both men and women. However, due to an increase in the prevalence of self-reported cases, these differences decreased over time (rate ratios for both sexes: 1.6–1.2). Prevalence estimates were consistently higher among females compared with males irrespective of data source. While differences in the prevalence estimates between the two data sources were evident across all age groups, the reduction of these differences was greater among adolescent, young and middle-aged adults compared with those 70 years and older.Conclusions.The overall narrowing of differences over time reflects a convergence of information regarding the prevalence of mood and/or anxiety disorders trends between self-report and administrative data sources. While the administrative data-based prevalences remained relatively stable, the self-reported prevalences increased over time. These observations may reflect positive societal changes in the perceptions of mental health (declining stigma) and/or increasing mental health literacy. Additional research using non-ecological data is required to further our understanding of the observed findings and trends, including a data linkage exercise permitting a comparison of prevalence estimates and population characteristics from these two data sources both separately and merged.


Sign in / Sign up

Export Citation Format

Share Document