Xplore Journal of Statistics
Latest Publications


TOTAL DOCUMENTS

73
(FIVE YEARS 29)

H-INDEX

1
(FIVE YEARS 0)

Published By Institut Pertanian Bogor

2302-5751, 2302-5751

2021 ◽  
Vol 10 (3) ◽  
pp. 258-269
Author(s):  
Muh Nur Fiqri Adham ◽  
Budi Susetyo ◽  
Kusman Sadik ◽  
Satriyo Wibowo

Accreditation is an indicator of the quality of education at the education unit level. One affects the quality of education units is the school budget. School budgets are prepared in order to fulfill 8 national education standards. School budget management uses School Activity Plan and Budget Application (ARKAS) developed by the Ministry of Education, Culture, Research and Technology (Kemendikbudristek). ARKAS is an information system for managing school budget and expenditure planning. The Research is identifies the factors that influence the accreditation of high school (SMA) with accreditation as a response variable and 17 explanatory variables sourced from ARKAS and Dapodik data using ordinal logistic regression analysis. The best model stage is the model formed that has the smallest AIC value and has high model accuracy in determining the best model. The best model stage is the third model stage which is composed of 7 explanatory variables that affect the high school accreditation rating with AIC value of 1886,20 and model accuracy of 65,79%. The variables that affect to results of accreditation include school status, percentage of students eligible PIP, ratio of the number of students per number of teachers, percentage of teachers certified educators, ratio of the number of students per number of study groups, ratio of the number of students per number of computers, and ratio of the number of students per number of toilets


2021 ◽  
Vol 10 (3) ◽  
pp. 226-236
Author(s):  
Khusnul Khotimah ◽  
Itasia Dina Sulvianti ◽  
Pika Silvianti

The number of leper in West Java is an example of the count data case. The analyzes commonly used in count data is Poisson regression. This research will determine the variables that influence the number of leper in West Java. The data used is the number of leper in West Java in 2019. This data has an overdispersion condition and spatial heterogenity. To handle overdispersion, the negative binomial regression model can be employed. While spatial heterogenity is overcome by adding adaptive bisquare kernel weight. This research resulted Geographically Weighted Negative Binomial Regression (GWNBR) with a weighting adaptive bisquare kernel classifies regency/city in West Java into ten groups based on the variables that sigfinicantly influence the number of leper. In general, the variable in the percentage of households with Clean and Healthy Behavior (PHBS) has a significant effect in all regency/city in West Java. Especially for Bogor Regency, Depok City, Bogor City, and Pangandaran Regency, the variable of the percentage of people poverty does not have a significant effect on the number leper.


2021 ◽  
Vol 10 (3) ◽  
pp. 288-301
Author(s):  
Tri Wahyuni ◽  
Indahwati Indahwati ◽  
Kusman Sadik

DKI Jakarta is the center of the spread of Covid-19. This is indicated by the higher cumulative number of Covid-19 positive in DKI Jakarta compared to other provinces. The high number of cases in DKI Jakarta is a concern for all groups, so it is necessary to do forecasting to predict the number of Covid-19 positive in the next period. Accurate forecasting is needed to get better results. This study compares the Autoregressive Integrated Moving Average (ARIMA) and Artificial Neural Networks (ANN) methods in predicting the number of Covid-19 positive in DKI Jakarta. Forecasting accuracy is calculated using the value of Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and correlation. The results show that the best model for forecasting the number of Covid-19 positive in DKI Jakarta is ARIMA(0,1,1) with drift, with a MAPE value of 15.748, an RMSE of 268.808, and the correlation between the forecast value and the actual value of 0.845. Forecasting using ARIMA(0,1,1) with drift and BP(3,10,1) models produces the best forecast for the long forecasting period of the next six weeks.


2021 ◽  
Vol 10 (3) ◽  
pp. 237-247
Author(s):  
Ni Kadek Manik Dewantari ◽  
Utami Dyah Syafitri ◽  
Aam Alamudi

New student admissions are opened in three pathways including SNMPTN, SBMPTN, and Seleksi Mandiri. In order to improve the SNMPTN selection system at IPB, a study was conducted on the quality of SMA/MA which registered to IPB through school clustering. In general, cluster analysis cannot handle large and mixed-type data, so this school clustering used the Two-Step Cluster method with two alternatives, namely without handling outliers and handling 5 percent outliers. Both of these alternatives produced an average Silhouette coefficient value of 0.2 and 0.3 respectively, which was still under the good category. However, clustering without handling outliers resulted in more detailed cluster criteria with 4 optimal clusters. The criteria for these four clusters include, Cluster 1 is a category of Low Commitment, Low Quality, and Low Consistency schools, Cluster 2 and 3 are categories of schools that have special criteria in certain categories, and Cluster 4 is a category of High Commitment, High Quality, and High Consistency.


2021 ◽  
Vol 10 (3) ◽  
pp. 270-287
Author(s):  
Rifannisa Bahar ◽  
Pika Silvianti ◽  
Budi Susetyo

Mapping the quality of education in Indonesia needs to be studied so that the provincial government, as the institution responsible for secondary education management policies, can more easily determine priorities and what actions will be taken to improve the quality of education in Indonesia. One of the analytical methods that can be used to map the quality of education is fuzzy c-means. This research aims to classify the quality maps of provinces in Indonesia based on the results of SHS/MA accreditation using the fuzzy c-means method. The fuzzy c-means method can show the probability of objects entering a cluster with a degree of membership. The optimum cluster sizes obtained were 2 and 3. The final solution with cluster size 2 was 12 provinces categorized in cluster 1 and 22 provinces categorized in cluster 2. Clustering with cluster size 3 resulted in cluster 1 consisting of 11 provinces, cluster 2 consisting of 16 provinces, and cluster 3, which consists of 7 provinces. The main character of cluster 1 is a high national education standard score, while the main character of cluster 2 is a low national education standard score. Then the main character of group 3 is the national standard score, whose value is around the national average.


2021 ◽  
Vol 10 (3) ◽  
pp. 248-257
Author(s):  
Karel Fauzan Hakim ◽  
Pika Silvianti ◽  
Agus Mohammad Soleh

Covid-19 is a very troubling disease in Indonesia. Therefore, understanding public opinion is required to find solutions and evaluate the government performance in handling the pandemic. Twitter can be helpful to identify the public opinion of significant events. Twitter’s tweet is a large dimension text-based big data. It requires text sampling and text mining to be processed efficiently and effectively. Stratified random sampling with 20 repetitions applied to assume days as strata followed by topic modeling with latent Dirichlet allocation (LDA). This research aims to find out public opinion regarding Covid-19 and itsgrowth over time. Other than that, this research also aims to find out sampling effects on tweet data using stratified random sampling. Therefore, the extracted topics will be transformed into time-series data and considering the variety of the pattern made. Afterward, the transformation results will be explored and interpreted. This research suggests that discussions related to Covid-19 are divided into four topics by the first model, namely: “Vaccine”, “Positive or affected people”, “Health protocol”, and “Indonesia” then nine topics by the second model, namely: “Vaccine”, “Prayer”, “Health protocol”, “Social aid and corruption”, “Affected people”, “Indonesian economy”, “Work”, “Persuading to wear mask”, and “Willing to watch”. Furthermore, some topics peak whenever a significant event occurs in Indonesia. Afterward, this research suggests that 20 repetitions of stratified random sampling could provide good results.


2021 ◽  
Vol 10 (3) ◽  
pp. 214-225
Author(s):  
Rahma Dany Asyifa ◽  
Agus M Soleh ◽  
Bagus Sartono

Application development must be done by considering the usability factor of the application. Three aspects of usability measurement, namely usefulness, satisfaction, and ease of use, are latent variables that cannot be measured directly, so the appropriate analysis is the Structural Equation Model-Partial Least Square (SEM-PLS). PLS is a SEM analysis approach that does not require assumptions of data distribution and a minimum number of observations. The measurement of the usability of the Thymun application is described in two SEM-PLS models. This study aims to determine the best model and determine the effect of usefulness, satisfaction, and ease of use on the usability of the Thymun application. The data used is survey data to 44 Thymun application users. The sampling technique used was purposive sampling. The results showed that the best model has a good measure with an R-square value of 0.730 and Q2 0.453 with a Goodness of Fit 0.736. The variables of usefulness and ease of use have a significant effect on the 5% real level with path coefficient values ​​of 0.255 and 0.636. While the satisfaction variable does not have a significant effect on the 5% real level with a path coefficient of 0.058. Thymun application usability score is 76.47.


2021 ◽  
Vol 10 (2) ◽  
pp. 140-151
Author(s):  
Eka Setiawaty ◽  
Farit Mochamad Afendi ◽  
Cici Suhaeni

Increased competition between personal vehicle dealers make them need strategies to hold their customers and increase their sales. One of the strategies they could apply is prospecting their customers at the right time. We could predict the right time by identifying the relationship between the length of their purchase time and its factors based on the transaction data of Z Company from year 2002 to 2015 using Classification and Regression Trees (CART). Data analysis is separated between groups of customers who made the second purchase maximum of 10 years after the first purchase (group A) and more than 10 years after the first purchase (group B). Group A’s regression tree produces 8 terminal nodes with MAD value 1.84 years. The independent variables that plays a role are tenor, job, age, and brand. Group B’s regression tree produces 4 terminal nodes. Authorized service and job come out as independent variables which affect the splitting process. MAD value for Group B’s regression tree is 0.56 years.


2021 ◽  
Vol 10 (2) ◽  
pp. 167-181
Author(s):  
Noer Endah Islami ◽  
Utami Dyah Syafitri ◽  
Cici Suhaeni

In order to lead in the market, companies should have an innovation product. Before the innovation product lauch to the market, the marketing research should be done. The goal of the reasearch is to determine whether the new product is accepted or rejected in the market. This study was to identify the characteristics of the new product based on organoleptic point of view and performance the three type of new multivitamin products based on location and social economic classes (SEC) of respondents. MANOVA and biplot analysis were used in this research. Based on MANOVA, there were differences on the organoleptic point of view of respondents among three type of new multivitamin products. The three products had differences on the assessment of aroma, sour taste, and sour after taste. In addtion with biplot analysis, it was concluded that each product had different location for sale and the target of respondents based on sosial economic classes. According to respondents, product A was too sweet taste and too sour after taste in the mouth compared to others. This product was preferred by respondents who reside in South Jakarta with social economic classes (SEC) A2 and C1. Unlike product A, product B was too hard with a bit of bitter after taste in the mouth. This product was relatively preferred by respondents in various residential with social economic classes (SEC) B. Product C was strong aroma with smooth texture and more bitter taste than others. This product was preferred by respondents who reside in North Jakarta and Depok with social economic classes (SEC) A1. Overall, product B was preferred by respondents compared to other products.


2021 ◽  
Vol 10 (2) ◽  
pp. 182-196
Author(s):  
Nadya Amelia Dewi Suryana ◽  
Itasia Dina Sulvianti ◽  
Muhammad Nur Aidi

Water is an important factor in fulfilling the needs of living things, therefore the water that is used must be free from bacterias and do not contain any toxic substances. The most common water source comes from the river. Ciliwung River as one of the main rivers used for drinking, household needs, industrial needs, and transportation must have good water quality. Therefore, the Ciliwung River water quality needed to be known. The water quality is measured based on the parameters such as the physical water quality and the chemical water quality. The measurement of those parameters are classified to be complicated as it measured by laboratorium research, so that the identification of the chemical water quality parameter could be done through the physical water quality that is easier and simpler to be measured. This study aims to determine the variable of the physical water parameters that can be used to identify the chemical water quality parameters, so that the water quality of the Ciliwung River can be known in a simpler way. Statistical method that can be used to see the relationship between the two variable groups is the canonical correlation analysis. Canonical correlation analysis is a method in multiple variable analysis used to investigate the relationship of two groups of variables using the linear combination principle of the two variables. Based on the results of the canonical correlation analysis, it can be concluded that there is a relationship between the physical quality of water and the chemical quality of water. The correlation exists between the variables of physical quality of water, which are the water temperature and the content of suspended substances in water, with the variables of chemical quality of water, namely groups of metals (manganese levels in water and iron content in water) and groups of acid (the level of deep phosphate in water, the level of sulfate in water, the level of nitrite in water, and the level of nitrate in water). The relationship between the physical quality of water is positive between the temperature of water and the chemical quality of water whereas negative between the levels of suspended substances in water and the chemical quality of water.


Sign in / Sign up

Export Citation Format

Share Document