Comparing Fatal Encounters, Mapping Police Violence, and Washington Post Fatal Police Shooting Data from 2015–2019: A Research Note

2022 ◽  
pp. 073401682110710
Author(s):  
Benjamin P. Comer ◽  
Jason R. Ingram

This research note empirically assessed similarities and differences among three open-source data sets from 2015-2019. Fatal police shooting incidents were compared across Washington Post, Mapping Police Violence, and Fatal Encounters data over a five-year period. One-way ANOVAs, bivariate correlations, and proportional percentage differences were used to examine mean differences, correlational strength, and yearly percentage difference trends. No significant mean differences were observed between Fatal Encounters, Mapping Police Violence, and Washington Post. With one exception, bivariate correlations between all three data source dyads were consistently strong. Percentage difference comparisons among data source dyads, however, revealed that the sources are becoming more dissimilar in their reporting of fatal shootings over time. Our results complement existing literature that has compared open-source police shooting data to government sources and suggest that the three data sources were strongly associated with one another from 2015-2019. Increasing differences between sources, however, necessitate continued inspection of the data across the various open-source platforms over time.

2015 ◽  
Vol 32 (18) ◽  
pp. 2693-2723 ◽  
Author(s):  
William S. Parkin ◽  
Jeff Gruenewald

To date, no discussion has taken place in the social sciences as to the appropriateness of using open-source data to augment, or replace, official data sources in homicide research. The purpose of this article is to examine whether open-source data have the potential to be used as a valid and reliable data source in testing theory and studying homicide. Official and open-source homicide data were collected as a case study in a single jurisdiction over a 1-year period. The data sets were compared to determine whether open-sources could recreate the population of homicides and variable responses collected in official data. Open-source data were able to replicate the population of homicides identified in the official data. Also, for every variable measured, the open-sources captured as much, or more, of the information presented in the official data. Also, variables not available in official data, but potentially useful for testing theory, were identified in open-sources. The results of the case study show that open-source data are potentially as effective as official data in identifying individual- and situational-level characteristics, provide access to variables not found in official homicide data, and offer geographic data that can be used to link macro-level characteristics to homicide events.


2020 ◽  
Vol 11 (05) ◽  
pp. 769-784
Author(s):  
Ipek Ensari ◽  
Adrienne Pichon ◽  
Sharon Lipsky-Gorman ◽  
Suzanne Bakken ◽  
Noémie Elhadad

Abstract Background Self-tracking through mobile health technology can augment the electronic health record (EHR) as an additional data source by providing direct patient input. This can be particularly useful in the context of enigmatic diseases and further promote patient engagement. Objectives This study aimed to investigate the additional information that can be gained through direct patient input on poorly understood diseases, beyond what is already documented in the EHR. Methods This was an observational study including two samples with a clinically confirmed endometriosis diagnosis. We analyzed data from 6,925 women with endometriosis using a research app for tracking endometriosis to assess prevalence of self-reported pain problems, between- and within-person variability in pain over time, endometriosis-affected tasks of daily function, and self-management strategies. We analyzed data from 4,389 patients identified through a large metropolitan hospital EHR to compare pain problems with the self-tracking app and to identify unique data elements that can be contributed via patient self-tracking. Results Pelvic pain was the most prevalent problem in the self-tracking sample (57.3%), followed by gastrointestinal-related (55.9%) and lower back (49.2%) pain. Unique problems that were captured by self-tracking included pain in ovaries (43.7%) and uterus (37.2%). Pain experience was highly variable both across and within participants over time. Within-person variation accounted for 58% of the total variance in pain scores, and was large in magnitude, based on the ratio of within- to between-person variability (0.92) and the intraclass correlation (0.42). Work was the most affected daily function task (49%), and there was significant within- and between-person variability in self-management effectiveness. Prevalence rates in the EHR were significantly lower, with abdominal pain being the most prevalent (36.5%). Conclusion For enigmatic diseases, patient self-tracking as an additional data source complementary to EHR can enable learning from the patient to more accurately and comprehensively evaluate patient health history and status.


Author(s):  
Ned Augenblick ◽  
Matthew Rabin

Abstract When a Bayesian learns new information and changes her beliefs, she must on average become concomitantly more certain about the state of the world. Consequently, it is rare for a Bayesian to frequently shift beliefs substantially while remaining relatively uncertain, or, conversely, become very confident with relatively little belief movement. We formalize this intuition by developing specific measures of movement and uncertainty reduction given a Bayesian’s changing beliefs over time, showing that these measures are equal in expectation and creating consequent statistical tests for Bayesianess. We then show connections between these two core concepts and four common psychological biases, suggesting that the test might be particularly good at detecting these biases. We provide support for this conclusion by simulating the performance of our test and other martingale tests. Finally, we apply our test to data sets of individual, algorithmic, and market beliefs.


2020 ◽  
Vol 122 (11) ◽  
pp. 1-32
Author(s):  
Michael A. Gottfried ◽  
Vi-Nhuan Le ◽  
J. Jacob Kirksey

Background It is of grave concern that kindergartners are missing more school than students in any other year of elementary school; therefore, documenting which students are absent and for how long is of upmost importance. Yet, doing so for students with disabilities (SWDs) has received little attention. This study addresses this gap by examining two cohorts of SWDs, separated by more than a decade, to document changes in attendance patterns. Research Questions First, for SWDs, has the number of school days missed or chronic absenteeism rates changed over time? Second, how are changes in the number of school days missed and chronic absenteeism rates related to changes in academic emphasis, presence of teacher aides, SWD-specific teacher training, and preschool participation? Subjects This study uses data from the Early Childhood Longitudinal Study (ECLS), a nationally representative data set of children in kindergarten. We rely on both ECLS data sets— the kindergarten classes of 1998–1999 and 2010–2011. Measures were identical in both data sets, making it feasible to compare children across the two cohorts. Given identical measures, we combined the data sets into a single data set with an indicator for being in the older cohort. Research Design This study examined two sets of outcomes: The first was number of days absent, and the second was likelihood of being chronically absent. These outcomes were regressed on a measure for being in the older cohort (our key measure for changes over time) and numerous control variables. The error term was clustered by classroom. Findings We found that SWDs are absent more often now than they were a decade earlier, and this growth in absenteeism was larger than what students without disabilities experienced. Absenteeism among SWDs was higher for those enrolled in full-day kindergarten, although having attended center-based care mitigates this disparity over time. Implications are discussed. Conclusions Our study calls for additional attention and supports to combat the increasing rates of absenteeism for SWDs over time. Understanding contextual shifts and trends in rates of absenteeism for SWDs in kindergarten is pertinent to crafting effective interventions and research geared toward supporting the academic and social needs of these students.


2018 ◽  
Vol 70 (6) ◽  
pp. 691-707 ◽  
Author(s):  
Daniel Torres-Salinas ◽  
Juan Gorraiz ◽  
Nicolas Robinson-Garcia

Purpose The purpose of this paper is to analyze the capabilities, functionalities and appropriateness of Altmetric.com as a data source for the bibliometric analysis of books in comparison to PlumX. Design/methodology/approach The authors perform an exploratory analysis on the metrics the Altmetric Explorer for Institutions, platform offers for books. The authors use two distinct data sets of books. On the one hand, the authors analyze the Book Collection included in Altmetric.com. On the other hand, the authors use Clarivate’s Master Book List, to analyze Altmetric.com’s capabilities to download and merge data with external databases. Finally, the authors compare the findings with those obtained in a previous study performed in PlumX. Findings Altmetric.com combines and orderly tracks a set of data sources combined by DOI identifiers to retrieve metadata from books, being Google Books its main provider. It also retrieves information from commercial publishers and from some Open Access initiatives, including those led by university libraries, such as Harvard Library. We find issues with linkages between records and mentions or ISBN discrepancies. Furthermore, the authors find that automatic bots affect greatly Wikipedia mentions to books. The comparison with PlumX suggests that none of these tools provide a complete picture of the social attention generated by books and are rather complementary than comparable tools. Practical implications This study targets different audience which can benefit from the findings. First, bibliometricians and researchers who seek for alternative sources to develop bibliometric analyses of books, with a special focus on the Social Sciences and Humanities fields. Second, librarians and research managers who are the main clients to which these tools are directed. Third, Altmetric.com itself as well as other altmetric providers who might get a better understanding of the limitations users encounter and improve this promising tool. Originality/value This is the first study to analyze Altmetric.com’s functionalities and capabilities for providing metric data for books and to compare results from this platform, with those obtained via PlumX.


2009 ◽  
Vol 9 (5) ◽  
pp. 22177-22222
Author(s):  
L. W. Thomason ◽  
J. R. Moore ◽  
M. C. Pitts ◽  
J. M. Zawodny ◽  
E.-W. Chiou

Abstract. Herein, we provide an assessment of the data quality of Stratospheric Aerosol and Gas Experiment (SAGE III) Version 4 aerosol extinction coefficient and water vapor data products. The evaluation is based on comparisons with data from four instruments: SAGE II, the Polar Ozone and Aerosol Measurement (POAM III), the Halogen Occultation Experiment (HALOE), and the Microwave Limb Sounder (MLS). Since only about half of the SAGE III channels have a direct comparison with measurements by other instruments, we have employed some empirical techniques to evaluate measurements at some wavelengths. We find that the aerosol extinction coefficient measurements at 449, 520, 755, 869, and 1021 nm are reliable with accuracies and precisions on the order of 10% in the primary aerosol range of 15 to 25 km. We also believe this to be true of the aerosol measurements at 1545 nm though we cannot exclude some positive bias below 15 km. We recommend use of the 385 nm measurements above 16 km where the accuracy is on par with other aerosol channels. The 601 nm measurement is much noisier (~20%) than other channels and we suggest caution in the use of these data. We believe that the 676 nm data are clearly defective particularly above 20 km (accuracy as poor as 50%) and the precision is also low (~30%). We suggest excluding this channel under most circumstances. The SAGE III Version 4 water vapor data product appears to be high quality and is recommended for science applications in the stratosphere below 45 km. In this altitude range, the mean differences with all four corroborative data sets are no bigger than 15% and often less than 10% with exceptional agreement with POAM III and MLS. Above 45 km, it seems likely that SAGE III water vapor values are increasingly too large and should be used cautiously or avoided. We believe that SAGE III meets its preflight goal of 15% accuracy and 10% precision between 15 and 45 km. We do not currently recommend limiting the SAGE III water vapor data utility in the stratosphere by aerosol loading.


Author(s):  
Christian Olalla-Soler ◽  
Javier Franco Aixelá ◽  
Sara Rovira-Esteva

This article identifies the specific characteristics of Cognitive Translation and Interpreting Studies (CTIS) as a branch of Translation and Interpreting Studies (TIS), adopting a bibliometric approach. The main data source for this study was the Bibliography of Interpreting and Translation (BITRA), which – as at September 2019 – included more than 77,000 TIS records, covering the diversity of languages and document types used in TIS research. BTRA is the only TIS database to feature citing information. CTIS-related records were analysed, and those published between 1976 and 2015 were compared with the whole corpus of TIS research output for the same period – again, as registered in BITRA. Specifically, we analysed: (a) the general features and evolution of CTIS publications over time (by thematic co-occurrence, by title content words, by format and by language); (b) authorship, focusing on co-authorship and on the most productive authors; (c) the citation patterns of CTIS documents, including a brief analysis of its most cited authors and publications; and (d) CTIS accessibility through a study of the ratio of documents published in open access. These aspects were analysed both synchronically and diachronically so as to describe CTIS as a whole and to identify any changes over time. Our results yield a first overview of CTIS from a bibliometric perspective and provide a methodological point of departure for future bibliometric studies in this area.


2020 ◽  
Author(s):  
Ana Sofía M. Uzsoy ◽  
Parsa Zareiesfandabadi ◽  
Jamie Jennings ◽  
Alexander F. Kemper ◽  
Mary Williard Elting

The mitotic spindle is a microtubule-based machine that pulls the two identical sets of chromosomes to opposite ends of the cell during cell division. The fission yeast Schizosaccharomyces pombe is an important model organism for studying mitosis due to its simple, stereotyped spindle structure and well-established genetic toolset. S. pombe spindle length is a useful metric for mitotic progression, but manually tracking spindle ends in each frame to measure spindle length over time is laborious and can limit experimental throughput. We have developed an ImageJ plugin that can automatically track S. pombe spindle length over time and replace manual or semi-automated tracking of spindle elongation dynamics. Using an algorithm that detects the principal axis of the spindle and then finds its ends, we reliably track the length and angle of the spindle as the cell divides. The plugin integrates with existing ImageJ features, exports its data for further analysis outside of ImageJ, and does not require any programming by the user. Thus, the plugin provides an accessible tool for quantification of S. pombe spindle length that will allow automatic analysis of large microscopy data sets and facilitate screening for effects of cell biological perturbations on mitotic progression.


Author(s):  
Sacha J. van Albada ◽  
Jari Pronold ◽  
Alexander van Meegen ◽  
Markus Diesmann

AbstractWe are entering an age of ‘big’ computational neuroscience, in which neural network models are increasing in size and in numbers of underlying data sets. Consolidating the zoo of models into large-scale models simultaneously consistent with a wide range of data is only possible through the effort of large teams, which can be spread across multiple research institutions. To ensure that computational neuroscientists can build on each other’s work, it is important to make models publicly available as well-documented code. This chapter describes such an open-source model, which relates the connectivity structure of all vision-related cortical areas of the macaque monkey with their resting-state dynamics. We give a brief overview of how to use the executable model specification, which employs NEST as simulation engine, and show its runtime scaling. The solutions found serve as an example for organizing the workflow of future models from the raw experimental data to the visualization of the results, expose the challenges, and give guidance for the construction of an ICT infrastructure for neuroscience.


Rangifer ◽  
2019 ◽  
Vol 39 (1) ◽  
pp. 43-58
Author(s):  
Alexander K. Prichard ◽  
Ryan L. Klimstra ◽  
Brian T. Person ◽  
Lincoln S. Parrett

With industrial development expanding in the Arctic, there is increasing interest in quantifying the impacts of development projects on barren ground caribou (Rangifer tarandus granti). The primary data source to assess caribou distribution and predict impacts in remote areas of Alaska has shifted in recent decades from aerial survey data to telemetry data, but these techniques have different strengths and weaknesses. The ranges of two caribou herds, the Western Arctic Herd and the Teshekpuk Herd, overlap in northwest Alaska between Wainwright and Atqasuk, Alaska. Based on long-term telemetry data sets, this region was thought to be outside of the core calving ranges of both herds. Calving has long been reported to occur in this general area, but early reports assumed caribou were from the Western Arctic Herd and only one systematic aerial survey of caribou density and distribution during calving has been conducted in this area in recent decades. Following interest in industrial development in this area, we conducted aerial strip-transect surveys during early to mid-June 2013–2015 to directly assess the density and distribution of caribou in the area and we used existing telemetry data to compare our results to the seasonal distribution of both herds. Total caribou densities varied between 0.36 and 1.06 caribou/km² among years, and calf densities varied 0.04 and 0.25 calves/km² among years. Contrary to assumptions by early researchers in the area, telemetry data indicated that caribou in this area during early to mid-June were from the Teshekpuk Herd. The use of telemetry data alone underestimated the importance of this area for calving, but the combination of aerial surveys and telemetry data provided complementary information on caribou use of this area showing the importance of collecting the appropriate types of data for assessing potential impacts of development on caribou.


Sign in / Sign up

Export Citation Format

Share Document