scholarly journals Studying Crime and Place with the Crime Open Database

Author(s):  
M. P. J. Ashby

The study of spatial and temporal crime patterns is important for both academic understanding of crime-generating processes and for policies aimed at reducing crime. However, studying crime and place is often made more difficult by restrictions on access to appropriate crime data. This means understanding of many spatio-temporal crime patterns are limited to data from a single geographic setting, and there are few attempts at replication. This article introduces the Crime Open Database (code), a database of 16 million offenses from 10 of the largest United States cities over 11 years and more than 60 offense types. Open crime data were obtained from each city, having been published in multiple incompatible formats. The data were processed to harmonize geographic co-ordinates, dates and times, offense categories and location types, as well as adding census and other geographic identifiers. The resulting database allows the wider study of spatio-temporal patterns of crime across multiple US cities, allowing greater understanding of variations in the relationships between crime and place across different settings, as well as facilitating replication of research.

2018 ◽  
Author(s):  
Matthew P J Ashby

The study of spatial and temporal crime patterns is important for both academic understanding of crime-generating processes and for policies aimed at reducing crime. However, studying crime and place is often made more difficult by restrictions on access to appropriate crime data. This means understanding of many spatio-temporal crime patterns are limited to data from a single geographic setting, and there are few attempts at replication. This article introduces the Crime Open Database (CODE), a database of 16 million offenses from 10 of the largest United States cities over 11 years and more than 60 offense types. Open crime data were obtained from each city, having been published in multiple incompatible formats. The data were processed to harmonize geographic co-ordinates, dates and times, offense categories and location types, as well as adding census and other geographic identifiers. The resulting database allows the wider study of spatio-temporal patterns of crime across multiple US cities, allowing greater understanding of variations in the relationships between crime and place across different settings, as well as facilitating replication of research.


2018 ◽  
Author(s):  
Prathyush Sambaturu ◽  
Parantapa Bhattacharya ◽  
Jiangzhuo Chen ◽  
Bryan Lewis ◽  
Madhav Marathe ◽  
...  

BACKGROUND Agencies such as the Centers for Disease Control and Prevention (CDC) currently release influenza-like illness incidence data, along with descriptive summaries of simple spatio-temporal patterns and trends. However, public health researchers, government agencies, as well as the general public, are often interested in deeper patterns and insights into how the disease is spreading, with additional context. Analysis by domain experts is needed for deriving such insights from incidence data. OBJECTIVE Our goal was to develop an automated approach for finding interesting spatio-temporal patterns in the spread of a disease over a large region, such as regions which have specific characteristics (eg, high incidence in a particular week, those which showed a sudden change in incidence) or regions which have significantly different incidence compared to earlier seasons. METHODS We developed techniques from the area of transactional data mining for characterizing and finding interesting spatio-temporal patterns in disease spread in an automated manner. A key part of our approach involved using the principle of minimum description length for representing a given target set in terms of combinations of attributes (referred to as clauses); we considered both positive and negative clauses, relaxed descriptions which approximately represent the set, and used integer programming to find such descriptions. Finally, we designed an automated approach, which examines a large space of sets corresponding to different spatio-temporal patterns, and ranks them based on the ratio of their size to their description length (referred to as the compression ratio). RESULTS We applied our methods using minimum description length to find spatio-temporal patterns in the spread of seasonal influenza in the United States using state level influenza-like illness activity indicator data from the CDC. We observed that the compression ratios were over 2.5 for 50% of the chosen sets, when approximate descriptions and negative clauses were allowed. Sets with high compression ratios (eg, over 2.5) corresponded to interesting patterns in the spatio-temporal dynamics of influenza-like illness. Our approach also outperformed description by solution in terms of the compression ratio. CONCLUSIONS Our approach, which is an unsupervised machine learning method, can provide new insights into patterns and trends in the disease spread in an automated manner. Our results show that the description complexity is an effective approach for characterizing sets of interest, which can be easily extended to other diseases and regions beyond influenza in the US. Our approach can also be easily adapted for automated generation of narratives.


2022 ◽  
Vol 2022 ◽  
pp. 1-18
Author(s):  
Xiaohua Luo ◽  
Jiaruo Peng ◽  
Mingsong Mao

There are a lot of studies that show that criminal activities exhibit certain temporal and spatial regularities. However, they often focus on either specific cities or types of crime and cannot clearly explain the patterns for the crime. What are the temporal patterns at the microlevel spatial scale? How general? Understanding the regularities of urban crime is important because it can help us improve the economy and safety of the cities and maintain harmony. This study analyzes the theft and burglary crime data from five cities in the United States. We successfully find the spatiotemporal patterns of two types of crime in different time series across cities.


2007 ◽  
Vol 14 (1) ◽  
pp. 83-100 ◽  
Author(s):  
Lance A. Waller ◽  
Brett J. Goodwin ◽  
Mark L. Wilson ◽  
Richard S. Ostfeld ◽  
Stacie L. Marshall ◽  
...  

10.2196/12842 ◽  
2020 ◽  
Vol 6 (3) ◽  
pp. e12842
Author(s):  
Prathyush Sambaturu ◽  
Parantapa Bhattacharya ◽  
Jiangzhuo Chen ◽  
Bryan Lewis ◽  
Madhav Marathe ◽  
...  

Background Agencies such as the Centers for Disease Control and Prevention (CDC) currently release influenza-like illness incidence data, along with descriptive summaries of simple spatio-temporal patterns and trends. However, public health researchers, government agencies, as well as the general public, are often interested in deeper patterns and insights into how the disease is spreading, with additional context. Analysis by domain experts is needed for deriving such insights from incidence data. Objective Our goal was to develop an automated approach for finding interesting spatio-temporal patterns in the spread of a disease over a large region, such as regions which have specific characteristics (eg, high incidence in a particular week, those which showed a sudden change in incidence) or regions which have significantly different incidence compared to earlier seasons. Methods We developed techniques from the area of transactional data mining for characterizing and finding interesting spatio-temporal patterns in disease spread in an automated manner. A key part of our approach involved using the principle of minimum description length for representing a given target set in terms of combinations of attributes (referred to as clauses); we considered both positive and negative clauses, relaxed descriptions which approximately represent the set, and used integer programming to find such descriptions. Finally, we designed an automated approach, which examines a large space of sets corresponding to different spatio-temporal patterns, and ranks them based on the ratio of their size to their description length (referred to as the compression ratio). Results We applied our methods using minimum description length to find spatio-temporal patterns in the spread of seasonal influenza in the United States using state level influenza-like illness activity indicator data from the CDC. We observed that the compression ratios were over 2.5 for 50% of the chosen sets, when approximate descriptions and negative clauses were allowed. Sets with high compression ratios (eg, over 2.5) corresponded to interesting patterns in the spatio-temporal dynamics of influenza-like illness. Our approach also outperformed description by solution in terms of the compression ratio. Conclusions Our approach, which is an unsupervised machine learning method, can provide new insights into patterns and trends in the disease spread in an automated manner. Our results show that the description complexity is an effective approach for characterizing sets of interest, which can be easily extended to other diseases and regions beyond influenza in the US. Our approach can also be easily adapted for automated generation of narratives.


2019 ◽  
Vol 264 ◽  
pp. 40-55 ◽  
Author(s):  
Marina Peña-Gallardo ◽  
Sergio M. Vicente-Serrano ◽  
Steven Quiring ◽  
Marc Svoboda ◽  
Jamie Hannaford ◽  
...  

2017 ◽  
Vol 5 (7) ◽  
pp. 771-788 ◽  
Author(s):  
A. Sankarasubramanian ◽  
J. L. Sabo ◽  
K. L. Larson ◽  
S. B. Seo ◽  
T. Sinha ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document