database integrity Latest Research Papers

Abstract How do we ensure a statewide voter registration database’s accuracy and integrity, especially when the database depends on aggregating decentralized, sub-state data with different list maintenance practices? We develop a Bayesian multivariate multilevel model to account for correlated patterns of change over time in multiple response variables, and label statewide anomalies using deviations from model predictions. We apply our model to California’s 22 million registered voters, using 25 snapshots from the 2020 presidential election. We estimate countywide change rates for multiple response variables such as changes in voter’s partisan affiliation and jointly model these changes. The model outperforms a simple interquartile range (IQR) detection when tested with synthetic data. This is a proof-of-concept that demonstrates the utility of the Bayesian methodology, as despite the heterogeneity in list maintenance practices, a principled, statistical approach is useful. At the county level, the total numbers of anomalies are positively correlated with the average election cost per registered voter between 2017 and 2019. Given the recent efforts to modernize and secure voter list maintenance procedures in the For the People Act of 2021, we argue that checking whether counties or municipalities are behaving similarly at the state level is also an essential step in ensuring electoral integrity.

Download Full-text

Bayesian Analysis of State Voter Registration Database Integrity

10.31219/osf.io/vnjbh ◽

2021 ◽

Author(s):

Jian Cao ◽

Seo-young Silvia Kim ◽

R. Michael Alvarez

Keyword(s):

Synthetic Data ◽

State Level ◽

Voter Registration ◽

Multiple Response ◽

Database Integrity ◽

The People ◽

Model Predictions ◽

Registered Voters ◽

Multivariate Multilevel Model ◽

Response Variables

How do we ensure a statewide voter registration database's accuracy and integrity, especially when the database depends on aggregating decentralized, sub-state data with different list maintenance practices? We develop a Bayesian multivariate multilevel model to account for correlated patterns of change over time in multiple response variables, and label statewide anomalies using deviations from model predictions. We apply our model to California's 22 million registered voters, using 25 snapshots from the 2020 presidential election. We estimate countywide change rates for multiple response variables such as changes in voter's partisan affiliation and jointly model these changes. The model outperforms a simple interquartile range (IQR) detection when tested with synthetic data. This is a proof-of-concept that demonstrates the utility of the Bayesian methodology, as despite the heterogeneity in list maintenance practices, a principled, statistical approach is useful. At the county level, the total numbers of anomalies are positively correlated with the average election cost per registered voter between 2017--2019. Given the recent efforts to modernize and secure voter list maintenance procedures in the For the People Act of 2021, we argue that checking whether counties or municipalities are behaving similarly at the state level is also an essential step in ensuring electoral integrity.

Download Full-text

Short Communication: IPC Salix Cultivar Database Proof-of-Concept

Forests ◽

10.3390/f12050631 ◽

2021 ◽

Vol 12 (5) ◽

pp. 631

Author(s):

Patrick N. McGovern ◽

Yulia A. Kuzovkina ◽

Raju Y. Soolanayakanahally

Keyword(s):

Integrated Approach ◽

Proof Of Concept ◽

Case Scenario ◽

Test Environment ◽

Food And Agriculture ◽

Database Integrity ◽

Food And Agriculture Organization ◽

Commercial Markets ◽

Report Data

A variety of Salix L. (Willow) tree and shrub cultivars provide resources for significant commercial markets such as bioenergy, environmental applications, basket manufacturing, and ornamental selections. The International Poplar Commission of the Food and Agriculture Organization (IPC FAO) has maintained the Checklist for Cultivars of Salix L. (Willow) since 2015 and now lists 968 epithet records in a Microsoft Excel spreadsheet format. This Proof-of-Concept (POC) investigates using an SQL database to store existing IPC Salix cultivar information and provide users with a format to compare and submit new Salix cultivar entries. The original IPC data were divided into three separate tables: Epithet, Species, and Family. Then, the data were viewed from three different model perspectives: the original Salix IPC spreadsheet data, the Canadian (PWCC), and the Open4st database. Requirements for this process need to balance database integrity rules with the ease of adding new Salix cultivar entries. An integrated approach from all three models proposed three tables: Epithet, Family, and Pedigree. The Epithet and Family tables also included Species data with a reference to a website link for accepted species names and details. The integrated process provides a more robust method to store and report data, but would require dedicated IT personnel to implement and maintain long-term. A potential use case scenario could involve users submitting their Checklist entries to the Salix administrator for review; the entries are then entered into a test environment by IT resources for final review and promotion to a production online environment. Perhaps the most beneficial outcome of this study is the investigation of various strategies and standards for Epithet and Family recording processes, which may benefit the entire Populus and Salix communities.

Download Full-text

Research on the Integration of Ideological and Political Elements in the Course “Database Technology and Application”

E3S Web of Conferences ◽

10.1051/e3sconf/202124503038 ◽

2021 ◽

Vol 245 ◽

pp. 03038

Author(s):

Yuan Liu

Keyword(s):

Work Experience ◽

Database Security ◽

Chinese Higher Education ◽

Concept Design ◽

Positive Role ◽

Teaching Objectives ◽

Database Integrity ◽

Teaching Concept ◽

Database Technology ◽

Main Basis

In the course of curriculum ideological and political integrated teaching concept design process, the main basis is “Living morality and fostering people”, which has a positive role in promoting the development of Chinese higher education. To this end, relevant staff should integrate ideological politics and daily course teaching. This article summarizes the curriculum teaching objectives and curriculum ideological and political objectives based on previous work experience. The author discusses the integration of ideological and political elements in the course “Database Technology and Application” from seven aspects. They including top-level design and planning, revision of curriculum syllabus, reform practice and promotion, “database security maintenance” teaching ideology and politics, “database integrity” teaching ideology and politics, ideological and political improvement of teacher construction, and ideological and political realization of teaching process.

Download Full-text

Securing Database Integrity in Intelligent Government Systems that Employ Fog Computing Technology

2020 International Conference on Computing and Data Science (CDS) ◽

10.1109/cds49703.2020.00048 ◽

2020 ◽

Author(s):

Brajendra Panda ◽

Abdulwahab Alazeb

Keyword(s):

Fog Computing ◽

Computing Technology ◽

Database Integrity

Download Full-text

Database Integrity:

Ethical Programs ◽

10.2307/j.ctv65swg4.8 ◽

2018 ◽

pp. 103-133

Keyword(s):

Database Integrity

Download Full-text

Leveraging Conceptual Data Models for Keeping Cassandra Database Integrity

Proceedings of the 14th International Conference on Web Information Systems and Technologies ◽

10.5220/0007236303980403 ◽

2018 ◽

Cited By ~ 1

Author(s):

Pablo Suárez-Otero ◽

Maria José Suárez-Cabal ◽

Javier Tuya

Keyword(s):

Data Models ◽

Database Integrity ◽

Conceptual Data

Download Full-text

Twilightreloaded: the peptide experience

Acta Crystallographica Section D Structural Biology ◽

10.1107/s205979831601620x ◽

2017 ◽

Vol 73 (3) ◽

pp. 211-222 ◽

Cited By ~ 6

Author(s):

Christian X. Weichenberger ◽

Edwin Pozharski ◽

Bernhard Rupp

Keyword(s):

Electron Density ◽

Complex Structure ◽

Data Bank ◽

Peptide Ligands ◽

Peptide Ligand ◽

Database Integrity ◽

Biomolecular Crystallography ◽

Validation Tool ◽

Small Molecule Ligand ◽

Protein Part

Thede factocommoditization of biomolecular crystallography as a result of almost disruptive instrumentation automation and continuing improvement of software allows any sensibly trained structural biologist to conduct crystallographic studies of biomolecules with reasonably valid outcomes: that is, models based on properly interpreted electron density. Robust validation has led to major mistakes in the protein part of structure models becoming rare, but some depositions of protein–peptide complex structure models, which generally carry significant interest to the scientific community, still contain erroneous models of the bound peptide ligand. Here, the protein small-molecule ligand validation toolTwilightis updated to include peptide ligands. (i) The primary technical reasons and potential human factors leading to problems in ligand structure models are presented; (ii) a new method used to score peptide-ligand models is presented; (iii) a few instructive and specific examples, including an electron-density-based analysis of peptide-ligand structures that do not contain any ligands, are discussed in detail; (iv) means to avoid such mistakes and the implications for database integrity are discussed and (v) some suggestions as to how journal editors could help to expunge errors from the Protein Data Bank are provided.

Download Full-text