scholarly journals Aspects of Pareto distributions

2021 ◽  
pp. 55-66
Author(s):  
Johan Fellman

Different skew models, such as the lognormal and the Pareto functions, have been proposed as suitable descriptions of income distribution. Specific distributions are usually applied in empirical investigations. It is a common opinion that the Pareto curve often provides an adequate description of higher incomes. Recently, double Pareto distributions that obey the power law in both the upper and lower tails have been suggested to reflect a general distribution of personal income. In this study, the literature concerning double Pareto models is presented and the model is applied to Finnish income data. JEL classification numbers: I32. Keywords: Maximum likelihood estimate, Method of moments, Bayesian method, Mean Squared Error, Lognormal, double Pareto, Coefficient of determination, survival function, Geometric Brownian motion.

2021 ◽  
Vol 149 ◽  
Author(s):  
Junwen Tao ◽  
Yue Ma ◽  
Xuefei Zhuang ◽  
Qiang Lv ◽  
Yaqiong Liu ◽  
...  

Abstract This study proposed a novel ensemble analysis strategy to improve hand, foot and mouth disease (HFMD) prediction by integrating environmental data. The approach began by establishing a vector autoregressive model (VAR). Then, a dynamic Bayesian networks (DBN) model was used for variable selection of environmental factors. Finally, a VAR model with constraints (CVAR) was established for predicting the incidence of HFMD in Chengdu city from 2011 to 2017. DBN showed that temperature was related to HFMD at lags 1 and 2. Humidity, wind speed, sunshine, PM10, SO2 and NO2 were related to HFMD at lag 2. Compared with the autoregressive integrated moving average model with external variables (ARIMAX), the CVAR model had a higher coefficient of determination (R2, average difference: + 2.11%; t = 6.2051, P = 0.0003 < 0.05), a lower root mean-squared error (−24.88%; t = −5.2898, P = 0.0007 < 0.05) and a lower mean absolute percentage error (−16.69%; t = −4.3647, P = 0.0024 < 0.05). The accuracy of predicting the time-series shape was 88.16% for the CVAR model and 86.41% for ARIMAX. The CVAR model performed better in terms of variable selection, model interpretation and prediction. Therefore, it could be used by health authorities to identify potential HFMD outbreaks and develop disease control measures.


2021 ◽  
Vol 13 (3) ◽  
pp. 438
Author(s):  
Subrina Tahsin ◽  
Stephen C. Medeiros ◽  
Arvind Singh

Long-term monthly coastal wetland vegetation monitoring is the key to quantifying the effects of natural and anthropogenic events, such as severe storms, as well as assessing restoration efforts. Remote sensing data products such as Normalized Difference Vegetation Index (NDVI), alongside emerging data analysis techniques, have enabled broader investigations into their dynamics at monthly to decadal time scales. However, NDVI data suffer from cloud contamination making periods within the time series sparse and often unusable during meteorologically active seasons. This paper proposes a virtual constellation for NDVI consisting of the red and near-infrared bands of Landsat 8 Operational Land Imager, Sentinel-2A Multi-Spectral Instrument, and Advanced Spaceborne Thermal Emission and Reflection Radiometer. The virtual constellation uses time-space-spectrum relationships from 2014 to 2018 and a random forest to produce synthetic NDVI imagery rectified to Landsat 8 format. Over the sample coverage area near Apalachicola, Florida, USA, the synthetic NDVI showed good visual coherence with observed Landsat 8 NDVI. Comparisons between the synthetic and observed NDVI showed Root Mean Squared Error and Coefficient of Determination (R2) values of 0.0020 sr−1 and 0.88, respectively. The results suggest that the virtual constellation was able to mitigate NDVI data loss due to clouds and may have the potential to do the same for other data. The ability to participate in a virtual constellation for a useful end product such as NDVI adds value to existing satellite missions and provides economic justification for future projects.


2021 ◽  
Vol 13 (7) ◽  
pp. 3727
Author(s):  
Fatema Rahimi ◽  
Abolghasem Sadeghi-Niaraki ◽  
Mostafa Ghodousi ◽  
Soo-Mi Choi

During dangerous circumstances, knowledge about population distribution is essential for urban infrastructure architecture, policy-making, and urban planning with the best Spatial-temporal resolution. The spatial-temporal modeling of the population distribution of the case study was investigated in the present study. In this regard, the number of generated trips and absorbed trips using the taxis pick-up and drop-off location data was calculated first, and the census population was then allocated to each neighborhood. Finally, the Spatial-temporal distribution of the population was calculated using the developed model. In order to evaluate the model, a regression analysis between the census population and the predicted population for the time period between 21:00 to 23:00 was used. Based on the calculation of the number of generated and the absorbed trips, it showed a different spatial distribution for different hours in one day. The spatial pattern of the population distribution during the day was different from the population distribution during the night. The coefficient of determination of the regression analysis for the model (R2) was 0.9998, and the mean squared error was 10.78. The regression analysis showed that the model works well for the nighttime population at the neighborhood level, so the proposed model will be suitable for the day time population.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Daisuke Miyamori ◽  
Takeshi Uemura ◽  
Wenliang Zhu ◽  
Kei Fujikawa ◽  
Takaaki Nakaya ◽  
...  

AbstractThe recent increase of the number of unidentified cadavers has become a serious problem throughout the world. As a simple and objective method for age estimation, we attempted to utilize Raman spectrometry for forensic identification. Raman spectroscopy is an optical-based vibrational spectroscopic technique that provides detailed information regarding a sample’s molecular composition and structures. Building upon our previous proof-of-concept study, we measured the Raman spectra of abdominal skin samples from 132 autopsy cases and the protein-folding intensity ratio, RPF, defined as the ratio between the Raman signals from a random coil an α-helix. There was a strong negative correlation between age and RPF with a Pearson correlation coefficient of r = 0.878. Four models, based on linear (RPF), squared (RPF2), sex, and RPF by sex interaction terms, were examined. The results of cross validation suggested that the second model including linear and squared terms was the best model with the lowest root mean squared error (11.3 years of age) and the highest coefficient of determination (0.743). Our results indicate that the there was a high correlation between the age and RPF and the Raman biological clock of protein folding can be used as a simple and objective forensic age estimation method for unidentified cadavers.


2017 ◽  
Vol 4 (02) ◽  
pp. 274-293
Author(s):  
Nur Hidayati ◽  
J.M.V. Mulyadi

ABSTRACT The purpose of this study is to examine whether variables such as quality of goods/services procurement committee, income of goods/services procurement committee, procurement system and procurement system, procurement ethic of goods/services and internal control system have influence to fraud of goods/ services procurement in the ministry of health affairs agency. Population in this research is all auditor related in process of procurement of goods/services, while the object of research (sample) that is as much as 56 people. The technique of determining the sample using purposive sampling method. Data were tested using validity test, reliability test, multicolinearity test, heteroskedasticity test, multiple regression analysis, hypothesis test and coefficient of determination. The result of the research shows that the quality of procurement committee variables significantly and negatively affect the fraud of procurement of goods/services. The income of the procurement committee does not significantly affect the procurement of goods/services, procurement system and procedures have significant effect and negative to the goods/service procurement, ethics have significant effect and negative to the procurement of goods/services and internal control system significantly and negative to the fraud of procurement of goods/services. ABSTRAK Tujuan dari penelitian ini adalah untuk menguji apakah variabel seperti kualitas panitia pengadaan barang/jasa, penghasilan panitia pengadaan barang/jasa, sistem dan prosedur pengadaan barang/jasa, etika pengadaan barang/jasa, dan sistem pengendalian internal memiliki pengaruh terhadap fraud pengadaan barang/jasa di Lingkungan Instansi Kementerian Kesehatan RI. Populasi dalam penelitian ini adalah seluruh auditor yang terkait dalam proses pengadaan barang/jasa, sedangkan yang dijadikan objek penelitian (sampel) yaitu sebanyak 56 orang. Teknik penentuan sampel menggunakan metode purposive sampling. Data diuji menggunakan uji validitas, uji reliabilitas, uji multikolinearitas, uji heteroskedastisitas, analisis regresi berganda, uji hipotesis dan koefisien determinasi. Hasil penelitian menunjukkan bahwa variabel kualitas panitia pengadaan berpengaruh secara signifikan dan negatif terhadap fraud pengadaan barang/jasa. Penghasilan panitia pengadaan tidak berpengaruh secara signifikan terhadap terhadap fraud pengadaan barang/jasa, sistem dan prosedur pengadaan berpengaruh secara signifikan dan negatif terhadap fraud pengadaan barang/jasa, ketika berpengaruh secara signifikan dan negatif terhadap fraud pengadaan barang/jasa dan sistem pengendalian internal berpengaruh secara signifikan dan negatif terhadap fraud pengadaan barang/jasa. JEL Classification: M41, M42, H57


2018 ◽  
Vol 6 (1-2) ◽  
pp. 50-65 ◽  
Author(s):  
Rittwik Chatterjee ◽  
Srobonti Chattopadhyay ◽  
Tarun Kabiraj

Spillovers of R&D outcome affect the R&D decision of a firm. The present paper discusses the R&D incentives of a firm when the extent of R&D spillover is private information to each firm. We construct a two-stage game involving two firms when the firms first decide simultaneously whether to invest in R&D or not, then they compete in quantity. Assuming general distribution function of firm types we compare R&D incentives of firms under alternative scenarios based on different informational structures. The paper shows that while R&D spillovers reduce R&D incentives under complete information unambiguously, however, it can be larger under incomplete information. JEL Classification: D43, D82, L13, O31


2021 ◽  
Vol 2021 ◽  
pp. 1-14
Author(s):  
Samuel Osah ◽  
Akwasi A. Acheampong ◽  
Collins Fosu ◽  
Isaac Dadzie

The growing demand for Global Navigation Satellite System (GNSS) technology has necessitated the establishment of a vast and ever-growing network of International GNSS Service (IGS) tracking stations worldwide. The IGS provides highly accurate and highly reliable daily time-series Zenith Tropospheric Delay (ZTD) products using data from the member sites towards the use of GNSS for precise geodetic, climatological, and meteorological applications. However, if for reasons like poor internet connectivity, equipment failure, and power outages, the IGS station is inaccessible or malfunctioning, and gaps are created in the data archive resulting in degrading the quality of the ZTD and precipitable water vapour (PWV) estimation. To address this challenge as a means of providing an alternative data source to improve the continuous availability of ZTD data and as a backup data in the event that the IGS site data are missing or unavailable in West Africa, this paper compares the sitewise operational Vienna Mapping Functions 3 (VMF3) ZTD product with the IGS final ZTD product over five IGS stations in West Africa. Eight different statistical evaluation metrics, such as the mean bias (MB), mean absolute error (MAE), root mean squared error (RMSE), Pearson correlation coefficient (r), coefficient of determination (r2), refined index of agreement (IAr), Nash–Sutcliffe coefficient of efficiency (NSE), and the fraction of prediction within a factor of two (FAC2), are employed to determine the degree of agreement between the VMF3 and IGS tropospheric products. The results show that the VMF3-ZTD product performed excellently and matches very well with the IGS final ZTD product with an average MB, MAE, RMSE, r, r2, NSE, IAr, and FAC2 of 0.38 cm, 0.87 cm, 1.11 cm, 0.988, 0.976, 0.967, 0.992, and 1.00 (100%), respectively. This result is an indication that the VMF3-ZTD product is accurate enough to be used as an alternative source of ZTD data to augment the IGS final ZTD product for positioning and meteorological applications in West Africa.


2018 ◽  
Vol 80 (01) ◽  
pp. 072-078 ◽  
Author(s):  
Berdine Heesterman ◽  
John-Melle Bokhorst ◽  
Lisa de Pont ◽  
Berit Verbist ◽  
Jean-Pierre Bayley ◽  
...  

Background To improve our understanding of the natural course of head and neck paragangliomas (HNPGL) and ultimately differentiate between cases that benefit from early treatment and those that are best left untreated, we studied the growth dynamics of 77 HNPGL managed with primary observation. Methods Using digitally available magnetic resonance images, tumor volume was estimated at three time points. Subsequently, nonlinear least squares regression was used to fit seven mathematical models to the observed growth data. Goodness of fit was assessed with the coefficient of determination (R 2) and root-mean-squared error. The models were compared with Kruskal–Wallis one-way analysis of variance and subsequent post-hoc tests. In addition, the credibility of predictions (age at onset of neoplastic growth and estimated volume at age 90) was evaluated. Results Equations generating sigmoidal-shaped growth curves (Gompertz, logistic, Spratt and Bertalanffy) provided a good fit (median R 2: 0.996–1.00) and better described the observed data compared with the linear, exponential, and Mendelsohn equations (p < 0.001). Although there was no statistically significant difference between the sigmoidal-shaped growth curves regarding the goodness of fit, a realistic age at onset and estimated volume at age 90 were most often predicted by the Bertalanffy model. Conclusions Growth of HNPGL is best described by decelerating tumor growth laws, with a preference for the Bertalanffy model. To the best of our knowledge, this is the first time that this often-neglected model has been successfully fitted to clinically obtained growth data.


2021 ◽  
Author(s):  
Hangsik Shin

BACKGROUND Arterial stiffness due to vascular aging is a major indicator for evaluating cardiovascular risk. OBJECTIVE In this study, we propose a method of estimating age by applying machine learning to photoplethysmogram for non-invasive vascular age assessment. METHODS The machine learning-based age estimation model that consists of three convolutional layers and two-layer fully connected layers, was developed using segmented photoplethysmogram by pulse from a total of 752 adults aged 19–87 years. The performance of the developed model was quantitatively evaluated using mean absolute error, root-mean-squared-error, Pearson’s correlation coefficient, coefficient of determination. The Grad-Cam was used to explain the contribution of photoplethysmogram waveform characteristic in vascular age estimation. RESULTS Mean absolute error of 8.03, root mean squared error of 9.96, 0.62 of correlation coefficient, and 0.38 of coefficient of determination were shown through 10-fold cross validation. Grad-Cam, used to determine the weight that the input signal contributes to the result, confirmed that the contribution to the age estimation of the photoplethysmogram segment was high around the systolic peak. CONCLUSIONS The machine learning-based vascular aging analysis method using the PPG waveform showed comparable or superior performance compared to previous studies without complex feature detection in evaluating vascular aging. CLINICALTRIAL 2015-0104


2021 ◽  
pp. 1-10
Author(s):  
Sandra K. Hnat ◽  
Musa L. Audu ◽  
Ronald J. Triolo ◽  
Roger D. Quinn

Estimating center of mass (COM) through sensor measurements is done to maintain walking and standing stability with exoskeletons. The authors present a method for estimating COM kinematics through an artificial neural network, which was trained by minimizing the mean squared error between COM displacements measured by a gold-standard motion capture system and recorded acceleration signals from body-mounted accelerometers. A total of 5 able-bodied participants were destabilized during standing through: (1) unexpected perturbations caused by 4 linear actuators pulling on the waist and (2) volitionally moving weighted jars on a shelf. Each movement type was averaged across all participants. The algorithm’s performance was quantified by the root mean square error and coefficient of determination (R2) calculated from both the entire trial and during each perturbation type. Throughout the trials and movement types, the average coefficient of determination was 0.83, with 89% of the movements with R2 > .70, while the average root mean square error ranged between 7.3% and 22.0%, corresponding to 0.5- and 0.94-cm error in both the coronal and sagittal planes. COM can be estimated in real time for balance control of exoskeletons for individuals with a spinal cord injury, and the procedure can be generalized for other gait studies.


Sign in / Sign up

Export Citation Format

Share Document