A Case Study on Data Quality, Privacy, and Entity Resolution

This chapter presents ongoing research conducted through collaboration between the University of Arkansas at Little Rock and the Arkansas Department of Education to develop an entity resolution and identity management system. The process includes a multi-phase approach consisting of data-quality analysis, selection of entity-identity attributes for entity resolution, development of a truth-set, and implementation and benchmarking of an entity-resolution rule set using the open source entity-resolution system named OYSTER. The research is the first known of its kind to evaluate privacy-enhancing, entity-resolution rule sets in a state education agency.

Download Full-text

A Case Study on Data Quality, Privacy, and Evaluating the Outcome of Entity Resolution Processes

International Journal of Organizational and Collective Intelligence ◽

10.4018/ijoci.2016070101 ◽

2016 ◽

Vol 6 (3) ◽

pp. 1-20

Author(s):

Pei Wang ◽

Daniel Pullen ◽

Fan Liu ◽

William C. Decker ◽

Ningning Wu ◽

...

Keyword(s):

Data Quality ◽

Identity Management ◽

False Negative ◽

Entity Resolution ◽

Quality Analysis ◽

Ongoing Research ◽

State Education ◽

Resolution Rule ◽

Rule Sets ◽

Education Agency

This paper presents ongoing research conducted through collaboration between the University of Arkansas at Little Rock and the Arkansas Department of Education to develop an entity resolution and identity management system. The process includes a multi-phase approach consisting of data-quality analysis, selection of entity-identity attributes for entity resolution, defined a rule set using the open source entity-resolution system named OYSTER and used entropy approach to identify the potential false positive and false negative. The research is the first known of its kind to evaluate privacy-enhancing, entity-resolution rule sets in a state education agency.

Download Full-text

Data quality analysis of the station of geological and technological researches in recognizing losses and kicks to improve the prediction accuracy of neural network algorithms

Neftyanoe khozyaystvo - Oil Industry ◽

10.24887/0028-2448-2020-8-63-67 ◽

2020 ◽

pp. 63-67 ◽

Cited By ~ 1

Author(s):

A.I. Arkhipov ◽

◽

A.N. Dmitrievsky ◽

N.A. Eremin ◽

A.D. Chernikov ◽

...

Keyword(s):

Neural Network ◽

Data Quality ◽

Prediction Accuracy ◽

Quality Analysis ◽

Network Algorithms ◽

Data Quality Analysis

Download Full-text

Free Appropriate Public Education, the U.S. Supreme Court, and Developing and Implementing Individualized Education Programs

Laws ◽

10.3390/laws10020038 ◽

2021 ◽

Vol 10 (2) ◽

pp. 38

Author(s):

Michael Rozalski ◽

Mitchell L. Yell ◽

Jacob Warner

Keyword(s):

Special Education ◽

Public Education ◽

Education Program ◽

Individualized Education ◽

Related Services ◽

Free Appropriate Public Education ◽

Substantive Content ◽

State Education ◽

Education Agency ◽

Education Act

In 1975, the Education for All Handicapped Children Act (renamed the Individuals with Disabilities Education Act in 1990) established the essential obligation of special education law, which is to develop a student’s individualized special education program that enables them to receive a free appropriate public education (FAPE). FAPE was defined in the federal law as special education and related services that: (a) are provided at public expense, (b) meet the standards of the state education agency, (c) include preschool, elementary, or secondary education, and (d) are provided in conformity with a student’s individualized education program (IEP). Thus, the IEP is the blueprint of an individual student’s FAPE. The importance of FAPE has been shown in the number of disputes that have arisen over the issue. In fact 85% to 90% of all special education litigation involves disagreements over the FAPE that students receive. FAPE issues boil down to the process and content of a student’s IEP. In this article, we differentiate procedural (process) and substantive (content) violations and provide specific guidance on how to avoid both process and content errors when drafting and implementing students’ IEPs.

Download Full-text

Linked Data Entity Resolution System Enhanced by Configuration Learning Algorithm

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2015edp7392 ◽

2016 ◽

Vol E99.D (6) ◽

pp. 1521-1530 ◽

Cited By ~ 5

Author(s):

Khai NGUYEN ◽

Ryutaro ICHISE

Keyword(s):

Linked Data ◽

Learning Algorithm ◽

Entity Resolution ◽

Resolution System

Download Full-text

A Comprehensive State Education Agency Plan to Promote the Integration of Students with Moderate/Severe Handicaps

Journal of the Association for Persons with Severe Handicaps ◽

10.1177/154079699001500207 ◽

1990 ◽

Vol 15 (2) ◽

pp. 106-113 ◽

Cited By ~ 6

Author(s):

Susan Hamre-Nietupski ◽

John Nietupski ◽

Steve Maurer

Keyword(s):

State Education Agency ◽

State Education ◽

Education Agency ◽

Severe Handicaps

Download Full-text

An Algebraic Approach to Data Quality Metrics for Entity Resolution over Large Datasets

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch196 ◽

2008 ◽

pp. 3067-3084

Author(s):

John Talburt ◽

Richard Wang ◽

Kimberly Hess ◽

Emily Kuo

Keyword(s):

Data Quality ◽

Algebraic Approach ◽

Entity Resolution ◽

Quality Metrics ◽

Quality Literature ◽

Intrinsic Quality ◽

Recognition Systems ◽

Entity Identification ◽

Data Quality Metrics

This chapter introduces abstract algebra as a means of understanding and creating data quality metrics for entity resolution, the process in which records determined to represent the same real-world entity are successively located and merged. Entity resolution is a particular form of data mining that is foundational to a number of applications in both industry and government. Examples include commercial customer recognition systems and information sharing on “persons of interest” across federal intelligence agencies. Despite the importance of these applications, most of the data quality literature focuses on measuring the intrinsic quality of individual records than the quality of record grouping or integration. In this chapter, the authors describe current research into the creation and validation of quality metrics for entity resolution, primarily in the context of customer recognition systems. The approach is based on an algebraic view of the system as creating a partition of a set of entity records based on the indicative information for the entities in question. In this view, the relative quality of entity identification between two systems can be measured in terms of the similarity between the partitions they produce. The authors discuss the difficulty of applying statistical cluster analysis to this problem when the datasets are large and propose an alternative index suitable for these situations. They also report some preliminary experimental results, and outlines areas and approaches to further research in this area.

Download Full-text