database integrity
Recently Published Documents


TOTAL DOCUMENTS

63
(FIVE YEARS 5)

H-INDEX

6
(FIVE YEARS 0)

2022 ◽  
Vol 0 (0) ◽  
Author(s):  
Jian Cao ◽  
Seo-young Silvia Kim ◽  
R. Michael Alvarez

Abstract How do we ensure a statewide voter registration database’s accuracy and integrity, especially when the database depends on aggregating decentralized, sub-state data with different list maintenance practices? We develop a Bayesian multivariate multilevel model to account for correlated patterns of change over time in multiple response variables, and label statewide anomalies using deviations from model predictions. We apply our model to California’s 22 million registered voters, using 25 snapshots from the 2020 presidential election. We estimate countywide change rates for multiple response variables such as changes in voter’s partisan affiliation and jointly model these changes. The model outperforms a simple interquartile range (IQR) detection when tested with synthetic data. This is a proof-of-concept that demonstrates the utility of the Bayesian methodology, as despite the heterogeneity in list maintenance practices, a principled, statistical approach is useful. At the county level, the total numbers of anomalies are positively correlated with the average election cost per registered voter between 2017 and 2019. Given the recent efforts to modernize and secure voter list maintenance procedures in the For the People Act of 2021, we argue that checking whether counties or municipalities are behaving similarly at the state level is also an essential step in ensuring electoral integrity.


2021 ◽  
Author(s):  
Jian Cao ◽  
Seo-young Silvia Kim ◽  
R. Michael Alvarez

How do we ensure a statewide voter registration database's accuracy and integrity, especially when the database depends on aggregating decentralized, sub-state data with different list maintenance practices? We develop a Bayesian multivariate multilevel model to account for correlated patterns of change over time in multiple response variables, and label statewide anomalies using deviations from model predictions. We apply our model to California's 22 million registered voters, using 25 snapshots from the 2020 presidential election. We estimate countywide change rates for multiple response variables such as changes in voter's partisan affiliation and jointly model these changes. The model outperforms a simple interquartile range (IQR) detection when tested with synthetic data. This is a proof-of-concept that demonstrates the utility of the Bayesian methodology, as despite the heterogeneity in list maintenance practices, a principled, statistical approach is useful. At the county level, the total numbers of anomalies are positively correlated with the average election cost per registered voter between 2017--2019. Given the recent efforts to modernize and secure voter list maintenance procedures in the For the People Act of 2021, we argue that checking whether counties or municipalities are behaving similarly at the state level is also an essential step in ensuring electoral integrity.


Forests ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 631
Author(s):  
Patrick N. McGovern ◽  
Yulia A. Kuzovkina ◽  
Raju Y. Soolanayakanahally

A variety of Salix L. (Willow) tree and shrub cultivars provide resources for significant commercial markets such as bioenergy, environmental applications, basket manufacturing, and ornamental selections. The International Poplar Commission of the Food and Agriculture Organization (IPC FAO) has maintained the Checklist for Cultivars of Salix L. (Willow) since 2015 and now lists 968 epithet records in a Microsoft Excel spreadsheet format. This Proof-of-Concept (POC) investigates using an SQL database to store existing IPC Salix cultivar information and provide users with a format to compare and submit new Salix cultivar entries. The original IPC data were divided into three separate tables: Epithet, Species, and Family. Then, the data were viewed from three different model perspectives: the original Salix IPC spreadsheet data, the Canadian (PWCC), and the Open4st database. Requirements for this process need to balance database integrity rules with the ease of adding new Salix cultivar entries. An integrated approach from all three models proposed three tables: Epithet, Family, and Pedigree. The Epithet and Family tables also included Species data with a reference to a website link for accepted species names and details. The integrated process provides a more robust method to store and report data, but would require dedicated IT personnel to implement and maintain long-term. A potential use case scenario could involve users submitting their Checklist entries to the Salix administrator for review; the entries are then entered into a test environment by IT resources for final review and promotion to a production online environment. Perhaps the most beneficial outcome of this study is the investigation of various strategies and standards for Epithet and Family recording processes, which may benefit the entire Populus and Salix communities.


2021 ◽  
Vol 245 ◽  
pp. 03038
Author(s):  
Yuan Liu

In the course of curriculum ideological and political integrated teaching concept design process, the main basis is “Living morality and fostering people”, which has a positive role in promoting the development of Chinese higher education. To this end, relevant staff should integrate ideological politics and daily course teaching. This article summarizes the curriculum teaching objectives and curriculum ideological and political objectives based on previous work experience. The author discusses the integration of ideological and political elements in the course “Database Technology and Application” from seven aspects. They including top-level design and planning, revision of curriculum syllabus, reform practice and promotion, “database security maintenance” teaching ideology and politics, “database integrity” teaching ideology and politics, ideological and political improvement of teacher construction, and ideological and political realization of teaching process.


2017 ◽  
Vol 73 (3) ◽  
pp. 211-222 ◽  
Author(s):  
Christian X. Weichenberger ◽  
Edwin Pozharski ◽  
Bernhard Rupp

Thede factocommoditization of biomolecular crystallography as a result of almost disruptive instrumentation automation and continuing improvement of software allows any sensibly trained structural biologist to conduct crystallographic studies of biomolecules with reasonably valid outcomes: that is, models based on properly interpreted electron density. Robust validation has led to major mistakes in the protein part of structure models becoming rare, but some depositions of protein–peptide complex structure models, which generally carry significant interest to the scientific community, still contain erroneous models of the bound peptide ligand. Here, the protein small-molecule ligand validation toolTwilightis updated to include peptide ligands. (i) The primary technical reasons and potential human factors leading to problems in ligand structure models are presented; (ii) a new method used to score peptide-ligand models is presented; (iii) a few instructive and specific examples, including an electron-density-based analysis of peptide-ligand structures that do not contain any ligands, are discussed in detail; (iv) means to avoid such mistakes and the implications for database integrity are discussed and (v) some suggestions as to how journal editors could help to expunge errors from the Protein Data Bank are provided.


2014 ◽  
Vol 8 (6) ◽  
pp. 25-40 ◽  
Author(s):  
Lancine Camara ◽  
Junyi Li ◽  
Renfa Li ◽  
Faustin Kagorora ◽  
Damien Hanyurwimfura

Sign in / Sign up

Export Citation Format

Share Document