Performance evaluation of Distance based Angular Clustering Algorithm (DACA) using data aggregation for heterogeneous WSN

Author(s):  
Navjot Kumar ◽  
Surinder Kaur
2018 ◽  
Vol 3 (1) ◽  
pp. 001
Author(s):  
Zulhendra Zulhendra ◽  
Gunadi Widi Nurcahyo ◽  
Julius Santony

In this study using Data Mining, namely K-Means Clustering. Data Mining can be used in searching for a large enough data analysis that aims to enable Indocomputer to know and classify service data based on customer complaints using Weka Software. In this study using the algorithm K-Means Clustering to predict or classify complaints about hardware damage on Payakumbuh Indocomputer. And can find out the data of Laptop brands most do service on Indocomputer Payakumbuh as one of the recommendations to consumers for the selection of Laptops.


Mathematics ◽  
2021 ◽  
Vol 9 (5) ◽  
pp. 469
Author(s):  
Chia-Nan Wang ◽  
Thi-Ly Nguyen ◽  
Thanh-Tuan Dang ◽  
Thi-Hong Bui

In Vietnam, fishing is a crucial source of nutrition and employment, which not only affects the development of the domestic economy but is also closely related to exports, heavily influencing the economy and foreign exchange. However, the Vietnamese fishery sector has been facing many challenges in innovating production technology, improving product quality, and expanding markets. Hence, the fishery enterprises need to find solutions to increase labor productivity and enhance competitiveness while minimizing difficulties. This study implemented a performance evaluation from 2015 to 2018 of 17 fishery businesses, in decision making units (DMUs), in Vietnam by applying data envelopment analysis, namely the Malmquist model. The objective of the paper is to provide a general overview of the fishery sector in Vietnam through technical efficiency, technological progress, and the total factor productivity in the four-year period. The variables used in the model include total assets, equity, total liabilities, cost of sales, revenue, and profit. The results of the paper show that Investment Commerce Fisheries Corporation (DMU10) and Hoang Long Group (DMU8) exhibited the best performances. This paper offers a valuable reference to improve the business efficiency of Vietnamese fishery enterprises and could be a useful reference for related industries.


Author(s):  
Ming Zhang ◽  
Nishant Kukadia

There is growing interest in incorporating urban form indicators into transportation planning and travel analysis. These indicators typically are measured at a certain level of spatial aggregation (e.g., traffic analysis zone) and therefore are subject to the modifiable areal unit problem (MAUP) known primarily in the statistical and geographic literature but generally overlooked by transportation researchers. The presence of the MAUP can cause serious inconsistency in analytical results and consequently misinform policy making. This study diagnoses the MAUP in measuring urban form through empirical modeling of travel mode choice in the Boston, Massachusetts, region. Using data aggregated in grids with five cell sizes and at the transportation analysis zone, the census block group, and the block level, the study explores the sensitivity of coefficient estimates for population density, network pattern, and land use balance to data aggregation in predicting mode choice decisions. Having confirmed the presence of the MAUP, the study discusses three approaches for dealing with it. Using a grid with a cell size of 1/2 mi appears to be the most desirable method of data aggregation among the eight methods studied. The suggested improvements in methodology will help advance the inquiry on the link between urban form and travel.


2020 ◽  
Vol 500 (1) ◽  
pp. 1323-1339
Author(s):  
Ciria Lima-Dias ◽  
Antonela Monachesi ◽  
Sergio Torres-Flores ◽  
Arianna Cortesi ◽  
Daniel Hernández-Lang ◽  
...  

ABSTRACT The nearby Hydra cluster (∼50 Mpc) is an ideal laboratory to understand, in detail, the influence of the environment on the morphology and quenching of galaxies in dense environments. We study the Hydra cluster galaxies in the inner regions (1R200) of the cluster using data from the Southern Photometric Local Universe Survey, which uses 12 narrow and broad-band filters in the visible region of the spectrum. We analyse structural (Sérsic index, effective radius) and physical (colours, stellar masses, and star formation rates) properties. Based on this analysis, we find that ∼88 per cent of the Hydra cluster galaxies are quenched. Using the Dressler–Schectman test approach, we also find that the cluster shows possible substructures. Our analysis of the phase-space diagram together with density-based spatial clustering algorithm indicates that Hydra shows an additional substructure that appears to be in front of the cluster centre, which is still falling into it. Our results, thus, suggest that the Hydra cluster might not be relaxed. We analyse the median Sérsic index as a function of wavelength and find that for red [(u − r) ≥2.3] and early-type galaxies it displays a slight increase towards redder filters (13 and 18 per cent, for red and early type, respectively), whereas for blue + green [(u − r)<2.3] galaxies it remains constant. Late-type galaxies show a small decrease of the median Sérsic index towards redder filters. Also, the Sérsic index of galaxies, and thus their structural properties, do not significantly vary as a function of clustercentric distance and density within the cluster; and this is the case regardless of the filter.


2021 ◽  
Author(s):  
Xin Sui ◽  
Wanjing Wang ◽  
Jinfeng Zhang

In this work, we trained an ensemble model for predicting drug-protein interactions within a sentence based on only its semantics. Our ensembled model was built using three separate models: 1) a classification model using a fine-tuned BERT model; 2) a fine-tuned sentence BERT model that embeds every sentence into a vector; and 3) another classification model using a fine-tuned T5 model. In all models, we further improved performance using data augmentation. For model 2, we predicted the label of a sentence using k-nearest neighbors with its embedded vector. We also explored ways to ensemble these 3 models: a) we used the majority vote method to ensemble these 3 models; and b) based on the HDBSCAN clustering algorithm, we trained another ensemble model using features from all the models to make decisions. Our best model achieved an F-1 score of 0.753 on the BioCreative VII Track 1 test dataset.


Sign in / Sign up

Export Citation Format

Share Document