Geographically Stratified Importance Sampling for the Calibration of Aggregated Destination Choice Models for Trip Distribution

Author(s):  
Min-Tang Li ◽  
Lee-Fang Chow ◽  
Fang Zhao ◽  
Shi-Chiang Li

A key feature in estimating and applying destination choice models with aggregate alternatives is to sample a set of nonchosen traffic analysis zones (TAZs), plus the one a trip maker chose, to construct a destination choice set. Computational complexity is reduced because the choice set would be too large if all study area TAZs were included in the calibration. Commonly, two types of sampling strategies are applied to draw subsets of alternatives from the universal choice set. The first, and simplest, approach is to select randomly a subset of nonchosen alternatives with uniform selection probabilities and then add the chosen alternative if it is not otherwise included. The approach, however, is not an efficient sampling scheme because most alternatives for a given trip maker may have small choice probabilities. The second approach, stratified importance sampling, draws samples with unequal selection probabilities determined on the basis of preliminary estimates of choice probabilities for every alternative in the universal choice set. The stratified sampling method assigns different selection probabilities to alternatives in different strata. Simple random sampling is applied to draw alternatives in each stratum. However, it is unclear how to divide the study area so that destination TAZs may be sampled effectively. The process of and findings from implementing a stratified sampling strategy in selecting alternative TAZs for calibrating aggregate destination choice models in a geographic information system (GIS) environment are described. In this stratified sampling analysis, stratum regions varied by spatial location and employment size in the adjacent area were defined for each study area TAZ. The sampling strategy is more effective than simple random sampling in regard to maximum log likelihood and goodness-of-fit values.

2020 ◽  
Vol 20 (2) ◽  
pp. 152-167
Author(s):  
Sebastian Gnat

Abstract Research background: Mass valuation is a process in which many properties are valued simultaneously with a uniform approach. An example of a procedure used for mass real estate valuation is the Szczecin Algorithm of Real Estate Mass Appraisal (SAREMA), which can be developed into a multiple regression model. The algorithm is based on a set of drawn representative properties. This set determines, inter alia, the quality of obtained valuations. Purpose: The objective of the study is to verify the hypothesis whether changing the method of sampling representative properties from the originally used simple random sampling to stratified sampling improves the results of the SAREMA econometric variant. Research methodology: The article presents a study that uses two methods of representative properties sampling – simple random sampling and stratified sampling. Errors of the models of valuation created taking into account both methods of sampling and different number of representative properties are compared. A key aspect of the survey is the choice of a better sampling method. Results: The study has shown that stratified sampling improves valuation results and, more specifically, allows for lower root mean square errors. Stratified sampling yielded better results in the initial phase of the study with more observations, but reducing the percentage of strata participating in the draws, despite the increase in RMSE, guaranteed lower errors than the corresponding results based on simple sampling in all variants of the study. Novelty: The article confirms the possibility of improving the results of mass property valuation by changing the scheme of representative properties sampling. The results allowed for the conclusion that stratified sampling is a better way of creating a set of representative properties.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Aya A. Mitani ◽  
Nathaniel D. Mercaldo ◽  
Sebastien Haneuse ◽  
Jonathan S. Schildcrout

Abstract Background A large multi-center survey was conducted to understand patients’ perspectives on biobank study participation with particular focus on racial and ethnic minorities. In order to enrich the study sample with racial and ethnic minorities, disproportionate stratified sampling was implemented with strata defined by electronic health records (EHR) that are known to be inaccurate. We investigate the effect of sampling strata misclassification in complex survey design. Methods Under non-differential and differential misclassification in the sampling strata, we compare the validity and precision of three simple and common analysis approaches for settings in which the primary exposure is used to define the sampling strata. We also compare the precision gains/losses observed from using a disproportionate stratified sampling scheme compared to using a simple random sample under varying degrees of strata misclassification. Results Disproportionate stratified sampling can result in more efficient parameter estimates of the rare subgroups (race/ethnic minorities) in the sampling strata compared to simple random sampling. When sampling strata misclassification is non-differential with respect to the outcome, a design-agnostic analysis was preferred over model-based and design-based analyses. All methods yielded unbiased parameter estimates but standard error estimates were lowest from the design-agnostic analysis. However, when misclassification is differential, only the design-based method produced valid parameter estimates of the variables included in the sampling strata. Conclusions In complex survey design, when the interest is in making inference on rare subgroups, we recommend implementing disproportionate stratified sampling over simple random sampling even if the sampling strata are misclassified. If the misclassification is non-differential, we recommend a design-agnostic analysis. However, if the misclassification is differential, we recommend using design-based analyses.


Author(s):  
Dina Indarsita ◽  
Mariaty S ◽  
Ravina Primursanti

Latar belakang: Masa pubertas adalah terjadinya perubahan biologis yang meliputi morfologi dan fisiologi yang terjadi dengan pesat dari masa anak kemasa dewasa, terutama kapasitas reproduksi yaitu perubahan alat kelamin dari tahap anak kedewasa. berdasarkan persentase terkecil aspek fisik pada perilaku remaja mengenai keadaan fisik diperoleh 48,4%. Hal ini mengindikasikan bahwa masih banyak siswa yang memiliki pengetahuan, penilaian serta pengharapan yang belum baik tentang perubahan fisik. Hasil penelitian lain menunjukkan Remaja pada masa pubertas memiliki penerimaan yang positif terhadap perubahan fisik, yaitu sebanyak 78,63% dan penerimaan negatif terhadap perubahan fisik, yaitu sebanyak 21,37%. Tujuan penelitian : ini adalah untuk mengetahui perilaku remaja dalam hal perubahan fisiologis pada masa pubertas di SMP Yayasan Pendidikan Shafiyyatul Amaliyyah Medan Tahun 2013. Metodologi : Desain penelitian yang digunakan dalam penelitian ini adalah deskriptif dengan pendekatan cross sectional. Jumlah sampel dalam penelitian ini adalah 173 orang dengan tehnik pengambilan sampel adalah secara proporsi bertingkat (proportional stratified sampling) dan acak sederhana (simple random sampling). Penelitian ini dilakukan pada bulan April 2013. Hasil : Hasil penelitian diperoleh pengetahuan remaja berpengetahuan baik sebanyak 134 orang (77,5 %), berpengetahuan cukup sebanyak 36 orang (20,8 %), dan berpengetahuan kurang sebanyak 3 orang (1,7 %), sikap remaja mayoritas memiliki sikap positif sebanyak 162 orang (93,6 %) dan minoritas memiliki sikap negatif sebanyak 11 orang (6,4 %), tindakan remaja diperoleh tindakan baik sebanyak 157 orang ( 90,8 %) dan tindakan kurang sebanyak 16 orang ( 9,2 %). Dari hasil penelitian ini diketahui bahwa perilaku remaja awal dalam hal perubahan fisiologis di SMP Yayasan Pendidikan Shafiyyatul Amaliyyah Medan Tahun 2013 baik.


2020 ◽  
pp. 16-20
Author(s):  
Chandni Kumari ◽  
Ratan Kumar Thakur

This paper considers the problem of estimating the population mean under double sampling. We have suggested the generalized class of estimators under Lahiri (1951) to Midzuno (1952) and Sen (1952) type sampling scheme and its properties are studied up to the first order of approximation. Further, we compare the proposed sampling strategy with some conventional estimators under the simple random sampling without replacement. On the basis of suitable range information, we give some concluding remarks related to propose sampling strategy. An empirical study is given in support of the present study.


2012 ◽  
Vol 610-613 ◽  
pp. 3732-3737 ◽  
Author(s):  
Ji Ping Zhang ◽  
Lin Bo Zhang ◽  
Bin Gong

This study combines the sampling technique, geographic information system and remote sensing technique to conduct a sampling survey on forest cover area of Jinggangshan National Nature Reserve in China on the basis of TM remote sensing image. The spatial simple random sampling, spatial stratified sampling and sandwich sampling model are respectively utilized to establish the sampling design. For the spatial simple random sampling model, the spatial autocorrelation analysis method is adopted to determine the spatial autocorrelation coefficient through calculating Moran's I index, while in the spatial stratified sampling and sandwich sampling model, the yearly maximum NDVI (Normalized Difference Vegetation Index) is utilized to conduct the spatial stratification. Through comparison of the sampling accuracy of three sampling models, a higher precision and more reasonable sampling method and sampling model is provided for remote sensing monitoring of forest cover area. The study results show that: sandwich sampling model is featured as the highest sampling accuracy, followed by the spatial stratified sampling and simple random sampling. Under the requirement of same precision, sandwich spatial sampling model can reduce quantity of the sampling points, and create all kinds of report units according to demands of different spatial area, so it is featured as the better suitability.


2015 ◽  
Vol 2 (1) ◽  
pp. 39
Author(s):  
Basman Basman ◽  
Takdir Saili ◽  
La Ode Ba'a

Penelitian ini bertujuan untuk mengetahui produktivitas ternak Kambing Kacang berdasarkan nilai kid crop dan mortalitas anak Kambing Kacang baik di wilayah kepulauan maupun wilayah daratan Kabupaten Buton. Penelitian ini dilaksanakan di Kecamatan Siompu (mewakili wilayah kepulauan) dan di Kecamatan Lapandewa (mewakili wilayah daratan) Kabupaten Buton. Metode penentuan lokasi penelitian dilakukan secara purposive sampling, stratified sampling dan simple random sampling dan penentuan responden di setiap desa dilakukan secara sensus. Data penelitian dianalisis secara deskriptif. Hasil penelitian menunjukkan bahwa kid crop Kambing Kacang di Kecamatan Siompu sebesar 150,98% dan Kecamatan Lapandewa sebesar 159,84%. Kidding Interval Kambing Kacang di Kecamatan Siompu sebesar 8,2 bulan dan di Kecamatan Lapandewa sebesar 8,19 bulan. Di Kecamatan Siompu diperoleh rataan litter size sebesar 1,77 dan di Kecamatan Lapandewa sebesar 1,53. Jumlah cempe Kambing Kacang yang lahir di Kecamatan Siompu sebanyak 84 ekor (38 ekor jantan dan 46 ekor betina). Di Kecamatan Lapandewa jumlah cempe Kambing Kacang yang lahir sebanyak 68 ekor (37 ekor jantan dan 31 ekor betina). Persentase mortalitas cempe kambing Kacang di Kecamatan Siompu sebesar 22,61% dan di Kecamatan Lapandewa sebesar 11,76%. Dapat disimpulkan bahwa produktivitas dan reprodutivitas ternak Kambing Kacang baik di wilayah kepulauan maupun di wilayah daratan Kabupaten Buton masih sangat baik, namun, tingkat mortalitas cempe di wilayah kepulauan masih relatif tinggi.Kata Kunci: Kambing Kacang, Performans, Kid Crop, Mortalitas, Lapandewa, Siompu


Author(s):  
Arina Anis Azlan ◽  
Mohammad Rezal Hamzah ◽  
Jen Sern Tham ◽  
Suffian Hadi Ayub ◽  
Abdul Latiff Ahmad ◽  
...  

Health literacy is progressively seen as an indicator to describe a nation’s health status. To improve health literacy, countries need to address health inequalities by examining different social demographic factors across the population. This assessment is crucial to identify and evaluate the strengths and limitations of a country in addressing health issues. By addressing these health inequalities, a country would be better informed to take necessary steps to improve the nation’s health literacy. This study examines health literacy levels in Malaysia and analyses socio-demographic factors that are associated with health literacy. A cross-sectional survey was carried out using the HLS-M-Q18 instrument, which was validated for the Malaysian population. Multi-stage random sampling strategy was used in this study, utilising several sampling techniques including quota sampling, cluster sampling, and simple random sampling to allow random data collection. A total of 855 respondents were sampled. Our results showed that there were significant associations between health literacy and age, health status, and health problems. Our findings also suggest that lower health literacy levels were associated with the younger generation. This study’s findings have provided baseline data on Malaysians’ health literacy and provide evidence showing potential areas of intervention.


2021 ◽  
Author(s):  
Arina Anis Azlan ◽  
Mohammad Rezal Hamzah ◽  
Tham Jen Sern ◽  
Suffian Hadi Ayub ◽  
Abdul Latiff Ahmad ◽  
...  

AbstractHealth literacy is progressively seen as an indicator to describe a nation’s health status. To improve health literacy, countries need to address health inequalities by examining different social demographic factors across the population. This assessment is crucial to identify and evaluate strengths and limitations of a country in addressing health issues. By addressing these health inequalities, a country would be better informed to take necessary steps to improve the nation’s health literacy. This study examines health literacy levels in Malaysia and analyses socio-demographic factors that are associated with health literacy. A cross-sectional survey was carried out using the HLS-M-Q18 instrument which was validated for the Malaysian population. Multi-stage random sampling strategy was used in this study utilising several sampling techniques including quota sampling, cluster sampling and simple random sampling to allow random data collection. A total of 855 respondents were sampled. Results found significant associations between health literacy and age, health status and health problems. Findings also suggest that lower health literacy levels were found to be associated with the younger generation. The findings of this study have provided baseline data of the health literacy of Malaysians and provide evidence toward potential areas of intervention.


Sign in / Sign up

Export Citation Format

Share Document