scholarly journals OCA: overlapping clustering application unsupervised approach for data analysis

Author(s):  
Alvincent Egonia Danganan ◽  
Ariel M. Sison ◽  
Ruji P. Medina

<p>In this paper, a new data analysis tool called Overlapping Clustering Application (OCA) was presented. It was developed to identify overlapping clusters and outliers in an unsupervised manner. The main function of OCA is composed of three phases. The first phase is the detection of the abnormal values(outliers) in the datasets using median absolute deviation. The second phase is to segment data objects into cluster using k-means algorithm. Finally, the last phase is the identification of overlapping clusters, it uses maxdist (maximum distance of data objects allowed in a cluster) as a predictor of data objects that can belong to multiple clusters.  Experimental results revealed that the developed OCA proved its capability in detecting overlapping clusters and outliers accordingly.</p>

2021 ◽  
Vol 10 (4) ◽  
pp. 2212-2222
Author(s):  
Alvincent E. Danganan ◽  
Edjie Malonzo De Los Reyes

Improved multi-cluster overlapping k-means extension (IMCOKE) uses median absolute deviation (MAD) in detecting outliers in datasets makes the algorithm more effective with regards to overlapping clustering. Nevertheless, analysis of the applied MAD positioning was not considered. In this paper, the incorporation of MAD used to detect outliers in the datasets was analyzed to determine the appropriate position in identifying the outlier before applying it in the clustering application. And the assumption of the study was the size of the cluster and cluster that are close to each other can led to a higher runtime performance in terms of overlapping clusters. Therefore, additional parameters such as radius of clusters and distance between clusters are added measurements in the algorithm procedures. Evaluation was done through experimentations using synthetic and real datasets. The performance of the eHMCOKE was evaluated via F1-measure criterion, speed and percentage of improvement. Evaluation results revealed that the eHMCOKE takes less time to discover overlap clusters with an improvement rate of 22% and achieved the best performance of 91.5% accuracy rate via F1-measure in identifying overlapping clusters over the IMCOKE algorithm. These results proved that the eHMCOKE significantly outruns the IMCOKE algorithm on mosts of the test conducted.


2021 ◽  
Vol 22 (2) ◽  
Author(s):  
Chiheb Eddine Ben Ncir

Overlapping clustering is an important challenge in unsupervised learning applications while it allows for each data object to belong to more than one group. Several clustering methods were proposed to deal with this requirement by using several usual clustering approaches. Although the ability of these methods to detect non-disjoint partitioning, they fail when data contain groups with arbitrary and non-spherical shapes. We propose in this work a new density based overlapping clustering method, referred to as OC-DD, which is able to detect overlapping clusters even having non-spherical and complex shapes. The proposed method is based on the density and distances to detect dense regions in data while allowing for some data objects to belong to more than one group.Experiments performed on articial and real multi-labeled datasets have shown the effectiveness of the proposed method compared to the existing ones.


2019 ◽  
Vol 4 (1) ◽  
pp. 111-118
Author(s):  
Inda Lestari ◽  
Miguna Astuti ◽  
Hariyanto Ridwan

This research was conducted to analyze the effect of innovation orientation and entrepreneurship on the competitive advantage of culinary MSMEs in the Cilandak Barat area, South Jakarta. The population used for this study was 36 actors in the culinary field of SMEC. The sampling technique uses a saturated sampling method. The data analysis tool used is PLS 3.0. The results of this study indicate that the innovation variable has a significant influence on the culinary competitiveness of SMEC. And, entrepreneurial orientation has a significant influence on the culinary competitiveness of SMEC. The researcher suggests SMECs to pay attention to other factors that can influence competitive advantage. Keywords: Innovation, Entrepreneurship Orientation, Competitive Advantage


2021 ◽  
Vol 13 (13) ◽  
pp. 7347
Author(s):  
Jangwan Ko ◽  
Seungsu Paek ◽  
Seoyoon Park ◽  
Jiwoo Park

This paper examines the main issues regarding higher education in Korea—where college education experienced minimal interruptions—during the COVID-19 pandemic through a big data analysis of news articles. By analyzing policy responses from the government and colleges and examining prominent discourses on higher education, it provides a context for discussing the implications of COVID-19 on education policy and what the post-pandemic era would bring. To this end, we utilized BIgKinds, a big data research solution for news articles offered by the Korea Press Foundation, to select a total of 2636 media reports and conducted Topic Modelling based on LDA algorithms using NetMiner. The analyses are split into three distinct periods of COVID-19 spread in the country. Some notable topics from the first phase are remote class, tuition refund, returning Chinese international students, and normalization of college education. Preparations for the College Scholastic Ability Test (CSAT), contact and contactless classes, preparations for early admissions, and supporting job market candidates are extracted for the second phase. For the third phase, the extracted topics include CSAT and college-specific exams, quarantine on campus, social relations on campus, and support for job market candidates. The results confirmed widespread public attention to the relevant issues but also showed empirically that the measures taken by the government and college administrations to combat COVID-19 had limited visibility among media reports. It is important to note that timely and appropriate responses from the government and colleges have enabled continuation of higher education in some capacity during the pandemic. In addition to the media’s role in reporting issues of public interest, there is also a need for continued research and discussion on higher education amid COVID-19 to help effect actual results from various policy efforts.


2020 ◽  
Vol 2 (3) ◽  
Author(s):  
Wilda Novita Sari ◽  
Ariusni Ariusni

Abstract: The purpose of this research is to be able to determine the effect of world oil prices on economic growth in Indonesia by applying the exchange rate moderating variable and the BI rate as a connecting variable. Descriptive and associative research is a type of research that is used with data collection techniques through a trusted official agency website that is classified in the quarterly time series secondary data. The data year in this study was from 2006 to 2018. Data analysis was carried out through descriptive and inductive analysis with a Moderated Regression Analysis (MRA) data analysis tool accompanied by a classic assumption test and a t test. Estimation results show that there are two research results; firstly, that the exchange rate has an effect on moderating the relationship between world oil prices and economic growth in Indonesia, secondly, that the BI rate has no influence connecting world oil prices and economic growth in Indonesia. Keywords: World oil prices, economic growth, exchange rates, BI rate, Moderated Regression Analysis (MRA).


2019 ◽  
Vol 6 (2) ◽  
pp. 93
Author(s):  
Frans Christiyanto

The purpose of this paper is to analyze the effect of variable communication, resources, disposition and organizational structure for program implementation RPJMD West Kutai 2011-2015, either partially or simultaneously. Type of this research is quantitative research. The analysis tool used is multiple linear regression. In this study using survey methods explanation (explanatory survey method) is a survey that explains the variables under study and further analyze the influence between variables accompanied by hypothesis testing. This research was conducted by collecting qualitative data, which will then be presented in the form of numbers (quantified) to be tested in accordance with the design verification of data analysis. The results showed the coefficient of determination (R2) of 0.421. There is significant influence between independent variables namely communication, resources, disposition and organizational structure for program implementation RPJMD West Kutai 2011-2015.Keyword: Implementation RPJMD, Communication, Resources


2020 ◽  
Vol 5 (2) ◽  
pp. 227
Author(s):  
Arniwita Arniwita ◽  
Deka Veronica ◽  
Ahmad Soleh

The Human Development Index (HDI) is an index to measure human achievement and is one of the indicators used in looking at people's well-being in a region. The higher the HDI value in a region, the better the level of welfare in the region. So often HDI is considered to have been able to represent the welfare level of the population, because in the HDI includes elements that include economic and noneconomic variables. Non-economic variables are measured from the level of public education and the degree of public health. While economic variables are measured from income levels indicating people's purchasing power, the three are related to each other. However, if you look at the conditions in Jambi Province, there is an interesting phenomenon where the development of the government does not or lack a real impact on the improvement of the Human Development Index (HDI), so it is necessary to do this research. The purpose of this study is to analyze the inequality, influence and relationship of the variables of the human development index which includes Gross Regional Domestic Product (GRDP) per capita, the number of medical personnel, the number of basic health facilities, the number of poor people as well as the number of teachers in public elementary schools as dependent variables with the human development index (HDI) as dependent variables. The data analysis method used in this study is a qualitative and qualitative descriptive method of explanatory properties, using sekuder data in the period 2008-2017. The data analysis tool used in this study uses the usual Weighted Coefficient of Variation (CVw) method for the first problem, the subsequent regression of the data panel for the second problem and the person correlation for the third problem. The hypothesis test in this study shows that there is inequality in IPM-forming variables in Jambi Province, further influence and significant relationship between ipm-forming variable inequality and HDI in Jambi Province.


2019 ◽  
Vol 6 (3) ◽  
pp. 285
Author(s):  
Muhammad Nawawi ◽  
Ahmad Alim Bachri ◽  
Dahniar ,

<p><em>The Purpose of the research to identify and analyze the effect of Remunerasi and Motivasion Work about Performance ol Civil Servants (PNS) simultaneously and partially on the Performance.</em></p><p><em>The method used is quantitative research that is explanatory, with a population of 191 people, technique sampling used is stratified random sampling with a sample of 66 people who serve as respondents, data analysis tool used is multiple linear regression using SPSS software ver.16.0.</em></p><em>The results ofthis studyconcludedthat the remunerationandmotivation to worksimultaneously and partiallypositive effect onperformance.Of the threeindepend- entvariable, the highest percentagewas97,8% remuneration(X1) compared with the twoother variables. This isbecausethatis essentiallyan employeeis unable to performanythingin general, there needs to becompensationimpetusin drivingdirectionoptimal work, compensationandjob satisfactionpolicyimplementationneed tobe included in thetraining sobeberpapositive influenceon employee performance</em>


2021 ◽  
Vol 5 (2) ◽  
pp. 263-273
Author(s):  
Ade Lia ◽  
Ibdalsyah Ibdalsyah ◽  
Hilman Hakiem

This study aims to determine the effect of consumer perceptions, halal labeling and brand image on purchasing decisions, while the independent variables are consumer perceptions, halal labeling and brand image. The data in this study were collected through questionnaires distributed to consumers who had purchased and used SR12 herbal skincare products in Bogor. The research method used is quantitative. The population in this study were consumers of SR12 herbal skincare products. With the data collected amounted to 100 respondents. The data analysis tool used in this study used multiple linear regression. The results of this study indicate that the variables of consumer perception, halal labeling and brand image have a positive and significant effect on purchasing decisions for sr12 herbal skincare products. Keywords: Consumer Perception, Halal Labeling, Brand Image and Purchase Decision


Sign in / Sign up

Export Citation Format

Share Document