scholarly journals Detecting Spammers via Aggregated Historical Data Set

Author(s):  
Eitan Menahem ◽  
Rami Pusiz ◽  
Yuval Elovici
Keyword(s):  
2021 ◽  
pp. 135481662110088
Author(s):  
Sefa Awaworyi Churchill ◽  
John Inekwe ◽  
Kris Ivanovski

Using a historical data set and recent advances in non-parametric time series modelling, we investigate the nexus between tourism flows and house prices in Germany over nearly 150 years. We use time-varying non-parametric techniques given that historical data tend to exhibit abrupt changes and other forms of non-linearities. Our findings show evidence of a time-varying effect of tourism flows on house prices, although with mixed effects. The pre-World War II time-varying estimates of tourism show both positive and negative effects on house prices. While changes in tourism flows contribute to increasing housing prices over the post-1950 period, this is short-lived, and the effect declines until the mid-1990s. However, we find a positive and significant relationship after 2000, where the impact of tourism on house prices becomes more pronounced in recent years.


ZooKeys ◽  
2014 ◽  
Vol 368 ◽  
pp. 79-89 ◽  
Author(s):  
Giovanni Amori ◽  
Gaetano Aloise ◽  
Luca Luiselli

2020 ◽  
Vol 17 (6) ◽  
pp. 607-616
Author(s):  
Anthony Hatswell ◽  
Nick Freemantle ◽  
Gianluca Baio ◽  
Emmanuel Lesaffre ◽  
Joost van Rosmalen

Background While placebo-controlled randomised controlled trials remain the standard way to evaluate drugs for efficacy, historical data are used extensively across the development cycle. This ranges from supplementing contemporary data to increase the power of trials to cross-trial comparisons in estimating comparative efficacy. In many cases, these approaches are performed without in-depth review of the context of data, which may lead to bias and incorrect conclusions. Methods We discuss the original ‘Pocock’ criteria for the use of historical data and how the use of historical data has evolved over time. Based on these factors and personal experience, we created a series of questions that may be asked of historical data, prior to their use. Based on the answers to these questions, various statistical approaches are recommended. The strategy is illustrated with a case study in colorectal cancer. Results A number of areas need to be considered with historical data, which we split into three categories: outcome measurement, study/patient characteristics (including setting and inclusion/exclusion criteria), and disease process/intervention effects. Each of these areas may introduce issues if not appropriately handled, while some may preclude the use of historical data entirely. We present a tool (in the form of a table) for highlighting any such issues. Application of the tool to a colorectal cancer data set demonstrates under what conditions historical data could be used and what the limitations of such an analysis would be. Conclusion Historical data can be a powerful tool to augment or compare with contemporary trial data, though caution is required. We present some of the issues that may be considered when involving historical data and what (if any) statistical approaches may account for differences between studies. We recommend that, where historical data are to be used in analyses, potential differences between studies are addressed explicitly.


Author(s):  
Arminée Kazanjian ◽  
Kathryn Friesen

AbstractIn order to explore the diffusion of the selected technologies in one Canadian province (British Columbia), two administrative data sets were analyzed. The data included over 40 million payment records for each fiscal year on medical services provided to British Columbia residents (2,968,769 in 1988) and information on physical facilities, services, and personnel from 138 hospitals in the province. Three specific time periods were examined in each data set, starting with 1979–80 and ending with the most current data available at the time. The detailed retrospective analysis of laboratory and imaging technologies provides historical data in three areas of interest: (a) patterns of diffusion and volume of utilization, (b) institutional profile, and (c) provider profile. The framework for the analysis focused, where possible, on the examination of determinants of diffusion that may be amenable to policy influence.


2014 ◽  
Vol 74 (2) ◽  
pp. 509-534 ◽  
Author(s):  
James Kai-sing Kung ◽  
Chicheng Ma

We examine the impact of rigorous trade suppression during 1550–1567 on the sharp rise of piracy in this period of Ming China. By analyzing a uniquely constructed historical data set, we find that the enforcement of a “sea (trade) ban” policy led to a rise in pirate attacks that was 1.3 times greater among the coastal prefectures more suitable for silk manufactures—our proxy for greater trade potential. Our study illuminates the conflicts in which China subsequently engaged with the Western powers, conflicts that eventually resulted in the forced abandonment of its long upheld autarkic principle.


2017 ◽  
Vol 47 (4) ◽  
pp. 495-520 ◽  
Author(s):  
Max Rånge ◽  
Mikael Sandberg

A new data set provides vital information about the world’s political institutions, from 1789 on a monthly and yearly basis and from 1600 on a yearly basis. The yearly data set from 1600 has more than 90,000 country–year observations, and the monthly data set from 1789 more than 600,000 observations—by far the most comprehensive to date, offering several advantages over other available ones. The data set aggregates specific attributes to create nominal and ordinal rankings of political regimes on a scale of 1 to 1,000. In addition to supporting a rigorous classification of democratic and nondemocratic regimes, it allows researchers to trace institutional variations and to explore alternative ways of aggregating political institutions. As a research instrument, the MaxRange data set permits historically minded scholars to address a number of issues related to the dynamics of political institutions in an unprecedented manner.


Demography ◽  
2017 ◽  
Vol 54 (2) ◽  
pp. 413-436 ◽  
Author(s):  
Francesco Cinnirella ◽  
Marc Klemp ◽  
Jacob Weisdorf

Abstract We use duration models on a well-known historical data set of more than 15,000 families and 60,000 births in England for the period 1540–1850 to show that the sampled families adjusted the timing of their births in accordance with the economic conditions as well as their stock of dependent children. The effects were larger among the lower socioeconomic ranks. Our findings on the existence of parity-dependent as well as parity-independent birth spacing in England are consistent with the growing evidence that marital birth control was present in pre-transitional populations.


2018 ◽  
Vol 18 (1) ◽  
pp. 1-11 ◽  
Author(s):  
Janusz W. Krzyścin ◽  
Piotr S. Sobolewski

Abstract. Erythemal daily doses measured at the Polish Polar Station, Hornsund (77°00′ N, 15°33′ E), for the periods 1996–2001 and 2005–2016 are homogenized using yearly calibration constants derived from the comparison of observed doses for cloudless conditions with the corresponding doses calculated by radiative transfer (RT) simulations. Modeled all-sky doses are calculated by the multiplication of cloudless RT doses by the empirical cloud modification factor dependent on the daily sunshine duration. An all-sky model is built using daily erythemal doses measured in the period 2005–2006–2007. The model is verified by comparisons with the 1996–1997–1998 and 2009–2010–2011 measured data. The daily doses since 1983 (beginning of the proxy data) are reconstructed using the all-sky model with the historical data of the column ozone from satellite measurements (SBUV merged ozone data set), the snow depth (for ground albedo estimation), and the observed daily sunshine duration at the site. Trend analyses of the monthly and yearly time series comprised of the reconstructed and observed doses do not reveal a statistically significant trend in the period 1983–2016. The trends based on the observed data only (1996–2001 and 2005–2016) show declining tendency (about −1 % per year) in the monthly mean of daily erythemal doses in May and June, and in the yearly sum of daily erythemal doses. An analysis of sources of the yearly dose variability since 1983 shows that cloud cover changes are a basic driver of the long-term UV changes at the site.


1982 ◽  
Vol 1 (18) ◽  
pp. 7
Author(s):  
Rodney J. Sobey

A hindcasting methodology is described for the total water level and wave hydrographs at a coastal site during a hurricane. It accommodates phasing of the separate components of the sustained water level (astronomical tide, storm tide, breaking wave setup) , as well as storm variability and coastal bathymetry. Complete hindcast models are utilised, but an intermediate cost and precision is achieved by compromising the number of complete hindcast storms, rather than the precision of the hindcast model. A synthesis technique is developed to predict the response hydrographs of the remaining storms in the historical data set.


Sign in / Sign up

Export Citation Format

Share Document