Detecting Spammers via Aggregated Historical Data Set

Using a historical data set and recent advances in non-parametric time series modelling, we investigate the nexus between tourism flows and house prices in Germany over nearly 150 years. We use time-varying non-parametric techniques given that historical data tend to exhibit abrupt changes and other forms of non-linearities. Our findings show evidence of a time-varying effect of tourism flows on house prices, although with mixed effects. The pre-World War II time-varying estimates of tourism show both positive and negative effects on house prices. While changes in tourism flows contribute to increasing housing prices over the post-1950 period, this is short-lived, and the effect declines until the mid-1990s. However, we find a positive and significant relationship after 2000, where the impact of tourism on house prices becomes more pronounced in recent years.

Download Full-text

Modern analyses on an historical data set: skull morphology of Italian red squirrel populations

ZooKeys ◽

10.3897/zookeys.368.4691 ◽

2014 ◽

Vol 368 ◽

pp. 79-89 ◽

Cited By ~ 4

Author(s):

Giovanni Amori ◽

Gaetano Aloise ◽

Luca Luiselli

Keyword(s):

Historical Data ◽

Red Squirrel ◽

Skull Morphology ◽

Data Set

Download Full-text

Summarising salient information on historical controls: A structured assessment of validity and comparability across studies

Clinical Trials ◽

10.1177/1740774520944855 ◽

2020 ◽

Vol 17 (6) ◽

pp. 607-616

Author(s):

Anthony Hatswell ◽

Nick Freemantle ◽

Gianluca Baio ◽

Emmanuel Lesaffre ◽

Joost van Rosmalen

Keyword(s):

Colorectal Cancer ◽

Historical Data ◽

Outcome Measurement ◽

Disease Process ◽

Patient Characteristics ◽

Study Patient ◽

Data Set ◽

Cancer Data ◽

Statistical Approaches ◽

Randomised Controlled

Background While placebo-controlled randomised controlled trials remain the standard way to evaluate drugs for efficacy, historical data are used extensively across the development cycle. This ranges from supplementing contemporary data to increase the power of trials to cross-trial comparisons in estimating comparative efficacy. In many cases, these approaches are performed without in-depth review of the context of data, which may lead to bias and incorrect conclusions. Methods We discuss the original ‘Pocock’ criteria for the use of historical data and how the use of historical data has evolved over time. Based on these factors and personal experience, we created a series of questions that may be asked of historical data, prior to their use. Based on the answers to these questions, various statistical approaches are recommended. The strategy is illustrated with a case study in colorectal cancer. Results A number of areas need to be considered with historical data, which we split into three categories: outcome measurement, study/patient characteristics (including setting and inclusion/exclusion criteria), and disease process/intervention effects. Each of these areas may introduce issues if not appropriately handled, while some may preclude the use of historical data entirely. We present a tool (in the form of a table) for highlighting any such issues. Application of the tool to a colorectal cancer data set demonstrates under what conditions historical data could be used and what the limitations of such an analysis would be. Conclusion Historical data can be a powerful tool to augment or compare with contemporary trial data, though caution is required. We present some of the issues that may be considered when involving historical data and what (if any) statistical approaches may account for differences between studies. We recommend that, where historical data are to be used in analyses, potential differences between studies are addressed explicitly.

Download Full-text

Optimal placement of wind turbines: A Monte Carlo approach with large historical data set

2010 IEEE International Conference on Electro/Information Technology ◽

10.1109/eit.2010.5612130 ◽

2010 ◽

Cited By ~ 7

Author(s):

Priti Sood ◽

Vincent Winstead ◽

Paul Steevens

Keyword(s):

Monte Carlo ◽

Wind Turbines ◽

Historical Data ◽

Optimal Placement ◽

Data Set ◽

Monte Carlo Approach

Download Full-text

Defusing Technology:Technology Diffusion in British Columbia

International Journal of Technology Assessment in Health Care ◽

10.1017/s0266462300003020 ◽

1993 ◽

Vol 9 (1) ◽

pp. 46-61 ◽

Cited By ~ 5

Author(s):

Arminée Kazanjian ◽

Kathryn Friesen

Keyword(s):

British Columbia ◽

Historical Data ◽

Current Data ◽

Fiscal Year ◽

Data Sets ◽

Canadian Province ◽

Data Set ◽

Time Periods ◽

Institutional Profile ◽

Areas Of Interest

AbstractIn order to explore the diffusion of the selected technologies in one Canadian province (British Columbia), two administrative data sets were analyzed. The data included over 40 million payment records for each fiscal year on medical services provided to British Columbia residents (2,968,769 in 1988) and information on physical facilities, services, and personnel from 138 hospitals in the province. Three specific time periods were examined in each data set, starting with 1979–80 and ending with the most current data available at the time. The detailed retrospective analysis of laboratory and imaging technologies provides historical data in three areas of interest: (a) patterns of diffusion and volume of utilization, (b) institutional profile, and (c) provider profile. The framework for the analysis focused, where possible, on the examination of determinants of diffusion that may be amenable to policy influence.

Download Full-text

Autarky and the Rise and Fall of Piracy in Ming China

The Journal of Economic History ◽

10.1017/s0022050714000345 ◽

2014 ◽

Vol 74 (2) ◽

pp. 509-534 ◽

Cited By ~ 5

Author(s):

James Kai-sing Kung ◽

Chicheng Ma

Keyword(s):

Historical Data ◽

Sharp Rise ◽

Data Set ◽

Trade Potential ◽

The Impact

We examine the impact of rigorous trade suppression during 1550–1567 on the sharp rise of piracy in this period of Ming China. By analyzing a uniquely constructed historical data set, we find that the enforcement of a “sea (trade) ban” policy led to a rise in pirate attacks that was 1.3 times greater among the coastal prefectures more suitable for silk manufactures—our proxy for greater trade potential. Our study illuminates the conflicts in which China subsequently engaged with the Western powers, conflicts that eventually resulted in the forced abandonment of its long upheld autarkic principle.

Download Full-text

Political Institutions and Regimes since 1600: A New Historical Data Set

Journal of Interdisciplinary History ◽

10.1162/jinh_a_01052 ◽

2017 ◽

Vol 47 (4) ◽

pp. 495-520 ◽

Cited By ~ 4

Author(s):

Max Rånge ◽

Mikael Sandberg

Keyword(s):

Political Institutions ◽

Historical Data ◽

Political Regimes ◽

Data Set ◽

Monthly Data ◽

Research Instrument ◽

Nondemocratic Regimes ◽

Yearly Data

A new data set provides vital information about the world’s political institutions, from 1789 on a monthly and yearly basis and from 1600 on a yearly basis. The yearly data set from 1600 has more than 90,000 country–year observations, and the monthly data set from 1789 more than 600,000 observations—by far the most comprehensive to date, offering several advantages over other available ones. The data set aggregates specific attributes to create nominal and ordinal rankings of political regimes on a scale of 1 to 1,000. In addition to supporting a rigorous classification of democratic and nondemocratic regimes, it allows researchers to trace institutional variations and to explore alternative ways of aggregating political institutions. As a research instrument, the MaxRange data set permits historically minded scholars to address a number of issues related to the dynamics of political institutions in an unprecedented manner.

Download Full-text

Malthus in the Bedroom: Birth Spacing as Birth Control in Pre-Transition England

Demography ◽

10.1007/s13524-017-0556-4 ◽

2017 ◽

Vol 54 (2) ◽

pp. 413-436 ◽

Cited By ~ 26

Author(s):

Francesco Cinnirella ◽

Marc Klemp ◽

Jacob Weisdorf

Keyword(s):

Birth Control ◽

Historical Data ◽

Birth Spacing ◽

Duration Models ◽

Economic Conditions ◽

Data Set ◽

Dependent Children ◽

Lower Socioeconomic

Abstract We use duration models on a well-known historical data set of more than 15,000 families and 60,000 births in England for the period 1540–1850 to show that the sampled families adjusted the timing of their births in accordance with the economic conditions as well as their stock of dependent children. The effects were larger among the lower socioeconomic ranks. Our findings on the existence of parity-dependent as well as parity-independent birth spacing in England are consistent with the growing evidence that marital birth control was present in pre-transitional populations.

Download Full-text

Trends in erythemal doses at the Polish Polar Station, Hornsund, Svalbard based on the homogenized measurements (1996–2016) and reconstructed data (1983–1995)

Atmospheric Chemistry and Physics ◽

10.5194/acp-18-1-2018 ◽

2018 ◽

Vol 18 (1) ◽

pp. 1-11 ◽

Cited By ~ 9

Author(s):

Janusz W. Krzyścin ◽

Piotr S. Sobolewski

Keyword(s):

Historical Data ◽

Sunshine Duration ◽

Satellite Measurements ◽

Data Set ◽

Modification Factor ◽

Daily Sunshine ◽

Yearly Dose ◽

Column Ozone ◽

Dose Variability

Abstract. Erythemal daily doses measured at the Polish Polar Station, Hornsund (77°00′ N, 15°33′ E), for the periods 1996–2001 and 2005–2016 are homogenized using yearly calibration constants derived from the comparison of observed doses for cloudless conditions with the corresponding doses calculated by radiative transfer (RT) simulations. Modeled all-sky doses are calculated by the multiplication of cloudless RT doses by the empirical cloud modification factor dependent on the daily sunshine duration. An all-sky model is built using daily erythemal doses measured in the period 2005–2006–2007. The model is verified by comparisons with the 1996–1997–1998 and 2009–2010–2011 measured data. The daily doses since 1983 (beginning of the proxy data) are reconstructed using the all-sky model with the historical data of the column ozone from satellite measurements (SBUV merged ozone data set), the snow depth (for ground albedo estimation), and the observed daily sunshine duration at the site. Trend analyses of the monthly and yearly time series comprised of the reconstructed and observed doses do not reveal a statistically significant trend in the period 1983–2016. The trends based on the observed data only (1996–2001 and 2005–2016) show declining tendency (about −1 % per year) in the monthly mean of daily erythemal doses in May and June, and in the yearly sum of daily erythemal doses. An analysis of sources of the yearly dose variability since 1983 shows that cloud cover changes are a basic driver of the long-term UV changes at the site.

Download Full-text

SYNTHESIS OP HURRICANE RESPONSE HYDROGRAPHS

Coastal Engineering Proceedings ◽

10.9753/icce.v18.7 ◽

1982 ◽

Vol 1 (18) ◽

pp. 7

Author(s):

Rodney J. Sobey

Keyword(s):

Water Level ◽

Historical Data ◽

Breaking Wave ◽

Total Water ◽

Coastal Site ◽

Data Set ◽

Storm Tide ◽

Synthesis Technique ◽

Astronomical Tide ◽

Total Water Level

A hindcasting methodology is described for the total water level and wave hydrographs at a coastal site during a hurricane. It accommodates phasing of the separate components of the sustained water level (astronomical tide, storm tide, breaking wave setup) , as well as storm variability and coastal bathymetry. Complete hindcast models are utilised, but an intermediate cost and precision is achieved by compromising the number of complete hindcast storms, rather than the precision of the hindcast model. A synthesis technique is developed to predict the response hydrographs of the remaining storms in the historical data set.

Download Full-text